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PREFACE TO THE REVISED EDITION 


WHILE DATA ON LEARNING accumulate with dizzying rapidity, the 
theories and the controversies about learning develop more slowly. 
Eight years after the first edition of Learning, the contrast between 
connectionist and cognitive interpretations remains central to learn- 
ing theory, and the convergence of the two continues. However, 
the details of this convergence appear somewhat different from 
those in 1963, and the new edition reflects these differences. 

The first half of the book is little changed from the first edition, 
except for an increase in the space devoted to Skinner, reflecting 
his increasing prominence in recent years. Most of the changes are 
in the latter half of the book. Sections have been added on frustra- 
tion theory and on computer-based models, the sections on math- 
ematical models and on mediation have been expanded, and the 
organization has been markedly revised. The result is hopefully a 
well-balanced integration of the important “classical” theories, the 
current theoretical developments, and the future potential of learn- 


ing theory. 


WINFRED F. HILL 


Evanston, Illinois 
May, 1971 
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PREFACE TO THE FIRST EDITION 


SYSTEMATIC INTERPRETATIONS OF learning have an important place 
both in the science of psychology and in the application of psy- 
chology to education. This introductory survey of contemporary 
learning theories is designed to provide a fairly elementary but 
nonetheless solid account of this topic for students in the psychol- 
ogy of learning and in educational psychology. 

In planning this book particularly for educators, I have tried 
to point up educational and related applications. However, I have 
been particularly concerned wtih what J. M. Stephens calls the 
“teacher as a theorist,” so that much of the emphasis is on learn- 
ing theory as an attempt to provide basic understanding of the 
learning process. The discussions in this book are therefore fully 
as appropriate for students in the liberal arts as for those in edu- 
cation. 

In attempting to classify psychological interpretations of learn- 
ing, I find the familiar division into connectionist and cognitive 
theories still the most useful. However, much of the interest in this 
classification is in the ways that these two kinds of theory have 
been moving closer together. The latter half of this book has as one 
of its main themes the attempts that have been made to combine 
the advantages of these two approaches. 

Another basis of classification, especially significant for the be- 
ginning student, is the degree of formality of the theory, varying 
from the chattiest of verbal interpretations to the most abstract 
of mathematical models. The former kind are both easier for the 
student and in most cases more oriented toward practical appli- 
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cations, but a balanced picture of learning theory requires that 
both be considered. In general, the discussion in this book pro- 
ceeds from the more simple and practical to the more complex 
and abstract. Chapter 1 provides an introduction to learning the- 
ory and its uses. Chapters 2 through 4 deal with relatively non- 
technical interpretations of learning, oriented toward educational 
and other applications. Chapters 5 and 6 treat more technical in- 
terpretations, and may be omitted if only a simple and practically- 
oriented coverage is desired. The final two chapters explore various 
aspects of current learning theory, highlighting unsettled issues and 
attempting to predict future eel 
Special thanks are due tð Ernest R. Hilgard, both for his thor- 
ough critical reading of the manuscript and for his many contri- 
butions to my thinking about learning theory. I also wish to thank 
David Ryans and Ruth Wylie for their many helpful comments, 
Elaine Johnson and Irene Nolte for their efforts in typing the manu- 
script, and my wife, Libby, for help in all stages of preparation. 


WINFRED F. HILL 
Evanston, Illinois 


January, 1963 
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a survey of psychological 
interpretations 


UNDERSTANDING 
AND EXPLAINING 
LEARNING 


THEORETICAL SYSTEMS FOR the interpretation of learning have 
grown up along with the experimental study of learning, and to 
understand one it is necessary to understand the other. Although 
the experimental study of learning to a large extent grew out of 
everyday problems, particularly problems in education, it soon 
became involved with theoretical issues and with the experiments 
appropriate for dealing with them. This theoretical involvement 
proceeded to such an extent that the original problems became 
barely recognizable. It is therefore worthwhile to take a long look 
at the interrelations among theory, research, and applications in 
the field of learning before undertaking a detailed survey of differ- 
ent theoretical systems. 

The reader should be warned at the outset that psychologists 
use the term “learning” more broadly than it is used in popular 
Speech. While it is almost impossible to give an exact definition 
of learning that will be generally acceptable to Psychologists, we 
Can at least note certain phenomena to which the term is or is not 
applied. In psychological usage, what is learned need not be “cor- 
Tect” or adaptive (we learn bad habits as well as good), need not 
be conscious or deliberate (one of the advantages of coaching in 
a skill is that it makes us aware of mistakes we have unconsciously 
learned to make), and need not involve any overt act (attitudes 
and emotions can be learned as well as knowledge and skills), Re- 
actions as diverse as driving a car, remembering a pleasant vaca- 
tion, believing in democracy, and disliking one’s boss all represent 


the results of learning. 


2 UNDERSTANDING AND EXPLAINING 


How does learning take place? What factors determine what 
we will learn and how rapidly we will learn it? There are innumer- 
able people in situations in which it would be useful to have an- 
swers to these questions. We think at once of students looking for 
better methods of study, of teachers wanting to improve their 
classroom techniques, and of people in industry seeking better 
ways of training new workers. We may also consider the mother 
looking for the best way of raising her children, the counselor 
trying to improve his client’s emotional and social adjustment, the 
animal trainer preparing seeing-eye dogs for their work, and the 
advertiser trying to develop a preference on the part of consum- 
ers for his client’s product. In all of these cases, knowledge about 
the learning process represents power. 

The above practical needs are not the only reasons for want- 
ing to know more about learning. Man has always been curious 
about himself, always wanted to knov’ more about what “makes 
him tick.” Since learning is such an important factor in what man 
is and does, a greater understanding of the learning process 
would go far toward increasing his self-knowledge. Consequently 
people are motivated to study learning, not only by the practical 
benefits to be gained but also by curiosity about themselves and 
how they came to be as they are. 


LEARNING IN SCHOOL 


Let us begin our discussion by looking at a setting in which 
learning is the primary focus: the school. A child in school faces a 
bewilderingly complex learning situation. It is complex from his 
point of view; it is even more so from the point of view of the psy- 
chologist who bravely attempts to analyze it. The child is influ- 
enced in countless ways by the varied aspects of the classroom 
situation. He learns much from the teacher, including many things 
not prescribed in the curriculum and some things of which neither 
teacher nor pupil is aware. He also learns from his books, from 
his fellow students, and from the physical arrangements of the 
school. Part of what he learns is measurable as specific knowl- 
edge and skills, while another part involves changes, some very 
subtle but a few quite dramatic, in attitudes, emotions, social be- 
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havior, and a variety of other reactions. The psychologist’s job is 
to analyze such complex situations into their component parts and 
to try to understand the principles of learning and motivation in- 
volved. 

Let us consider some incidents in the school day of a particu- 
lar sixth-grader, Alex B. We meet Alex first as he is studying a 
vocabulary list in his reading book. His teacher (a trifle old- 
fashioned) has instructed the class to learn to spell these words. 
He is thus confronted with a list of twelve words for which 
he must learn to give the correct written form when presented 
with the spoken form. However, in studying from his book 
he faces a slightly different task, since only the written form is 
before him and he must provide the spoken version (subvocally) 
himself. He goes through the list, pronouncing and spelling the 
words to himself, but finds his attention wandering. He looks out 
the window and remembers how much he enjoyed the previous 
weekend. Snapping back to the spelling book, he continues duti- 
fully to the end of the list. Then he tries covering all but the first 
letter of each word with his hand and trying to spell the word to 
himself. At “r” he proceeds cautiously, trying to remember the rule 
for “i” and “e.” Finally he decides on “receive,” moves his hand 
away, and relaxes a bit as he finds his guess confirmed. At “s” he 
Spells to himself without hesitation, “seperate,” then frowns as he 
Sees his mistake. He stares for a moment at the “ar,” trying to 
fix it in his memory, then continues through the list. Half way 
through, he finds himself unable to remember a word from the 
first letter. 

Annoyed, he looks at the word and then concentrates heavily 
On trying to remember it. Two words later, the same thing happens 
again. He looks at the word, wonders why it should be hard to 
remember, and begins to feel discouraged. He stares at the picture 
On the wall of Washington crossing the Delaware, and imagines 
himself leading such an expedition. Catching the teacher’s eye, he 
Teturns quickly to his book and finishes the list. He spells the last 
Word confidently, “cematary,” only to find “cemetery,” staring 
back at him from the page. Puzzled, he looks back and forth from 
“separate” to “cemetery” and thinks, “I hate spelling.” 
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We see Alex next at recess running excitedly out to join a soft- 
ball game. He begins the game in the outfield. As the batter hits a 
ground ball in his direction, he starts running, his speed and direc- 
tion adjusted to make his path intercept that of the ball. As 
he nears the ball he bends over, puts his hands in front of it, and 
closes them at just the right moment to catch the ball. He 
then looks up, notes the position of the runner, and throws the ball 
in the direction of first base, his arm motion just forceful enough 
to carry the ball to the first baseman. This whole sequence of co- 
ordinated, purposeful behavior occurs rapidly, almost automat- 
ically, with no evidence of thought or verbal self-instruction. 
Watching him, the teacher is impressed by how much Alex has 
improved at softball since she has known him. 

During the game, Alex calls frequently, ‘‘Wait’ll I get to bat!” 
and “When’m I up?” Finally his turn comes. He pounds the plate 
and shouts, “Put it over!” When the pitcher throws the ball, Alex 
takes a mighty swing at it, and misses. He looks chagrined, but 
shouts, “Lets have another!” Again he swings and misses. This 
time he frowns and says nothing. He stands more rigidly at the 
plate, his teeth clenched. His swing at the third pitch is more 
tense, less free, and again he misses. He throws the bat down and 
stalks away. At his next turn in the outfield, he does no shout- 
ing, and his fielding is less coordinated. When recess ends, he re- 
turns quietly to the classroom, ignoring the chatter around him. 
He seems relieved that recess is over. 

Later we see Alex during his math lesson. The class has been 
learning how to find the areas of rectangles. Now the teacher raises 
the problem of how to find the area of a triangle. She draws a right 
triangle with a 4” and a 3” side on the blackboard and asks the 
students what the area is. Alex likes math and enjoys learning 
how to solve new problems, so he eagerly tries to figure this one 
out. He tries to apply the rule for rectangles, but can’t see how to 
do it. Then the teacher draws two more lines, making a rectangle 
with the hypotenuse of the triangle as its diagonal. Alex stares at 
this for a moment, then grins and excitedly holds up his hand. 
“The area is six. The whole rectangle is 12, and there are two tri- 
angles, and each triangle is half of the rectangle!” The teacher 
smiles. “Very good. The area is half of 4 times 3. Math makes a 
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lot of sense when you figure it out like that.” Alex basks in his suc- 
cess, pleased both with the teacher’s approval and with his tri- 
umph over the impersonal challenge of the problem. 

We leave Alex now, happy that a rather frustrating day for him 
has ended with success. However, the challenge to a psychologist 
to analyze Alex’s experiences lasts long after the incidents 
are over. In fact, the narrative of Alex’s day might be used as a 
hidden-picture puzzle: “How many aspects of learning can you 
find illustrated in these incidents?” Consider, for example, the 
similarities and differences among the three situations in which 
we saw Alex. All involved learning, motivation, goal-directed be- 
havior, and success or failure. All could be analyzed in terms of 
the responses that Alex made to various stimuli in his environment, 
and also in terms of the way he perceived the environment. All 
occurred in social situations and against a background of norms 
and values shared (though not completely) by Alex, his teacher, 
and his fellow students. However, the three situations differed in 
the principal kind of learning involved (verbal memorization of 
the spelling list, motor skill on the softball field, and logical in- 
sight in the math problem). The situations also differed in the 
emotional learning that occurred and in some of the subsidiary 
principles. In each situation we may ask what factors contributed 
to Alex’s successes and failures. How much did his difficulty in 
Spelling “cemetery,” for example, depend on the word itself, on 
its position in the list, on the fact that “separate” came earlier in 
the same lesson, on distractions in the room at that moment, or 
on a variety of other facts about Alex, the lesson, and the gen- 
eral situation? There are endless possibilities for the psychologist 
to analyze in these three everyday situations. 

However, such analysis assumes that we already know a great 
deal about the principles of learning. In cases where a number of 
factors might have influenced Alex’s behavior, we cannot make 
Worthwhile guesses about how important each was unless we have 
information about how important each has been in other similar 
situations. For example, we cannot judge whether the word cem- 
etery” itself or the situation in which Alex encountered it contri- 
buted more to his error unless we know something about the rela- 
tive importance of words and their context in other cases of 
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memorization. It is to provide such information about the regu- 
larities in behavior that we do experimental studies. If we know 
from such studies that certain situations cause special difficulty 
in memorizing, we are in a much better position both to explain 
Alex’s errors and to find ways of helping him. 


LEARNING IN THE LABORATORY 


Let us now look at some typical laboratory learning situations. 
As compared with Alex’s experiences, some of these may appear 
artificial or trivial. This difference is the price we pay in order to 
have situations in which we can manipulate certain variables, hold 
others constant, and measure Precisely the resultant changes in 
behavior. We will see many relations between the complex situ- 
ations in which Alex learned and these simpler, better controlled, 
laboratory learning situations. 


Two Complex Learning Situations 


able to help him do the job is a pair of Pliers. Can he find a way 


(There need not be an actual room 
solution can all be given with pen- 
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at the closest point of its arc. Since the subject is now holding both 
strings, he can easily tie them together. 

How does this laboratory problem differ from the geometric 
problem that Alex solved in the classroom? In principle, very lit- 
tle. Both have simple solutions drawing on the solver’s past ex- 
perience, but both are difficult because they require the use of this 
past experience in new ways. The solution in each case usually 
involves a period of futile search followed by a flash of insight. 
A hint, such as the two extra lines the teacher drew. or the word 
“pendulum” in the two-string problem, often results in quick so- 
lution of the problem. And although we have spoken of these as 
problem-solving situations, they are quite impressive as learning 
situations, since the ability to tie the strings or compute the area 
not only appears suddenly but is also very well retained as com- 
pared with most other learned material. 

The important difference is not between the two problems but 
between the classroom and the laboratory. In the classroom case, 
we do not know how soon (if at all) Alex would have solved the 
problem without the teacher’s hint, or why Alex rather than an- 
other student solved it first, or how many other students would 
have found the solution if he had not beaten them to it, or how 
the presence of other students helped or hindered him. In the labo- 
ratory, on the other hand, all of these questions can be systemati- 
cally studied. The problem can be presented to subjects singly or 
in groups, with hints at predetermined points, and each person's 
time to solution (and his mistaken “solutions”) recorded. This 
laboratory situation makes it possible to investigate both sets of 
factors—those in the situation and those in the individual—that 
contribute to this learning. (If the reader feels uncomfortable 
about calling such problem solving “Jearning,” since it does not in- 
volve improvement with practice, he should note that a major 
theoretical approach to learning is built around just such “percep- 
tual restructuring” as is involved in solving these problems.) 

Another laboratory example of learning, closer to the popular 
image of a learning situation, involves rote memorization. The 
items to be memorized are usually either words or nonsense syl- 
lables (such as deg or czr), and the task may either be to learn a 
series of them in order (as would be the case in memorizing a 


8 UNDERSTANDING AND EXPLAINING 


chemical formula) or to learn a number of pairs of items with one 
member of a pair always given as the response to the other (com- 
parable to learning a foreign-language vocabulary). The items are 
typically presented in a memory drum, a machine that displays 
them one by one at a controlled rate through a small window. As 
each item appears in the window, the subject tries to say what the 
next item will be. The whole series of items is repeated again and 
again until the learner is able to anticipate each item correctly be- 
fore it appears. 

The contrast between such a learning task and Alex’s spelling 
lesson is again largely in the precision of control that the labora- 
tory setup permits. Alex could approach the words to be learned 
in any order and divide his time among them in any way he chose. 
Moreover, he had to study from a book, even though the test of 
his mastery would consist of writing the words in response to oral 
dictation. This difference forced him to devise techniques of prac- 
tice that would as nearly as possible match the test procedure. Fi- 
nally, he was subjected to the distractions of the classroom situa- 
tions. The laboratory subject, on the other hand, sees the items in 
a predetermined order and for constant amounts of time in a small 
room with a minimum of distractions. Since he is automatically 
tested on each item each time through the list, there is a continuous 
record of his mastery of each item at each stage in the learning. 
This record makes possible separate analyses of different stages of 
learning, different parts of the list, and different individual items. 


Three Simpler Learning Situations 


Both the problem-solving and the rote learning situations in 
the laboratory represent refinements of learning situations found 
in the classroom and elsewhere in everyday life outside the labora- 
tory. However, many psychological experimenters analyze these and 
other examples of learning (e.g., Alex’s learning of the motor skills 
involved in fielding a softball) into simpler components and then 
study these components separately. Rather than working with 
complex patterns of responses such as Alex learned, they study 
changes in the frequency or magnitude or speed of a single re- 
sponse. Their object is not to study everyday kinds of learning 
under better-controlled conditions, but to study the underlying 


LEARNING IN THE LABORATORY 9 


components of everyday learning in order to get a better under- 
standing of what learning is. Such understanding, it is hoped, can 
then be used to predict learning in a great variety of more com- 
plex situations. This approach is comparable to that in other sci- 
ences, where complex chemical substances are analyzed into their 
component elements, or where the speed of a falling body is stud- 
ied as a function of the distance it has fallen, without regard to 
whether the body is a mail bag, a person, or a hydrogen bomb. 

Since research of this sort attempts to reduce learning to its sim- 
plest essentials, animals have often been found more satisfactory 
subjects than humans. Animals lack the complexities of language 
and cultural traditions, their background of previous experience is 
both simpler and (at least if they are reared in the lab) less variable 
than that of humans, and their heredity as well as their rearing can 
to a considerable extent be controlled. All of these factors make 
it easier to study one aspect of the animal’s learning processes at 
a time while holding others constant. For example, if we want to 
study the rate at which a dog learns to flex his leg to avoid an elec- 
tric shock, we probably do not have to worry about his interpre- 
tation of the purpose of the experiment or about his concern with 
behaving bravely, politely, or rationally, all of which would be 
factors if we were studying the same behavior in humans. This is 
not to say that humans cannot usefully be studied in such situa- 
tions, for they often are, but to point out why for certain purposes 
animals are often preferred. 

One simple animal response which has been extensively stud- 
ied is the pecking response in pigeons. If a hungry pigeon is occa- 
sionally rewarded with food for pecking a lighted key on the side 
of its cage, it will peck the key at a high rate. This pecking rate is 
quite sensitive to changes in level of hunger, frequency of food re- 
ward, and other variables. It can thus be used to investigate many 
of the phenomena of simple learning. 

The low rate at which the pigeon pecks before food is in- 
troduced is called the operant rate. When food first begins to be 
presented as soon as the pigeon pecks the key, the rate of pecking 
rises. In technical terms, this change indicates that the food rein- 
forces the pecking, or that it is a reinforcer. If some but not all 
pecks are followed by food, the particular pattern used (e.g., 
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food after every tenth peck, or after one peck each minute) 
is known as the schedule of reinforcement. If pecking is no longer 
followed by food, the rate drops. This decline in rate resulting 
from the removal of the reinforcer is known as extinction. If after 
extinction takes place there is an interval of time during which 
the pigeon does not have access to the key, and then the key is pre- 
sented again, the pecking rate is likely to be higher than it was at 
the end of extinction. This increase in rate is called, naturally 
enough, spontaneous recovery. If a key that the pigeon has been 
reinforced for pecking is replaced by another key of a different 
color, the pigeon will peck at this key also, though not as much 
as at the original one. This tendency to respond to stimuli other 
than the one used in training is known as generalization. If, how- 
ever, the two keys are presented alternately, with pecking at one 
reinforced and pecking at the other nonreinforced (extinguished), 
the pigeon will learn to peck the reinforced key at a high rate and 
the nonreinforced key very little. We then say that a discrimination 
between the keys has been formed. 

Many aspects of the above learning phenomena have been stud- 
ied experimentally. How does the rate of pecking vary with the 
amount of food given, the schedule of reinforcement, or the delay 
between peck and food? How does spontaneous recovery vary with 
the time interval between extinction and test, or generalization 
vary with the difference in color between the two keys? Will mak- 
ing a pigeon more hungry increase or decrease his ability to form 
a discrimination? Questions of this sort may be studied, not only 
with regard to the rate of key-pecking in pigeons, but with re- 
gard to many other forms of behavior in a variety of species. 

These learning phenomena, established in experimental studies, 
can be applied (though often with some modification) to com- 
plex learning in everyday life. Though we did not see Alex over 
a long enough period of time to observe the effects of reinforce- 
ment and extinction in his behavior, we can note a number of cases 
where these processes were probably occurring for him. His state- 
ment about the area of the triangle was reinforced by the teacher’s 
praise, his techniques of fielding the softball by the approval of 
his fellow players, and his correct spellings by seeing his spell- 
ings confirmed by the printed words. On the other hand, his incor- 
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rect spellings, his batting techniques, and his competent manner 
with regard to batting were not reinforced and hence presumably 
underwent some extinction. The disappearance of his confident 
shouting after he struck out might be taken as evidence of extinc- 
tion. If the shouting reappeared at his next softball game, this re- 
newal could be interpreted as spontaneous recovery. The prob- 
lem of spelling “separate” with an “ar” and “cemetery” with an 
“er” clearly involved making a discrimination between two simi- 
lar situations, and it might also be interpreted as extinguishing the 
tendency to spell phonetically. 

Not only can we identify these learning processes operating in 
everyday life; we can also make predictions from learning exper- 
iments to everyday learning situations. For example, it is commonly 
found in experiments that a delay between the response and the 
reinforcer results in poorer performance of the learned response. 
From this finding we could make predictions about the efficacy of 
different teaching methods. If Alex had taken a spelling quiz on 
one day but had not found out which words he got right until the 
papers were handed back the next day, he probably would have 
learned less spelling than he did with his book, in consequence 
of the delay of reinforcement. On the other hand, it is possible 
that the opposite might be true, either because correctness on a 
graded test is more reinforcing than correctness on a private self- 
test, or because writing from dictation is a better way of studying 
spelling, or for a variety of other reasons. This example thus points 
up both the usefulness of laboratory study in making suggestions 
about everyday learning and the danger in taking such suggestions 
uncritically. 

A fourth example of laboratory learning, and one even less like 
the ordinary picture of a learning situation, occurs in experiments 
on classical conditioning. The first of these, and still the most fa- 
mous, were those in which the Russian physiologist Ivan Pavlov 
used the procedure to train dogs to salivate. Another version of 
this procedure (less famous than Pavlov’s salivating dogs, but cur- 
rently more popular in American labs) is the conditioning of the 
eyeblink reflex in humans. If a person who is watching a dim light 
sees the light grow somewhat brighter, he ordinarily does not 
blink his eyes in response to this stimulus. If, however, he is hit 
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in the eye by a vigorous pufi of air, he does blink. The ae 
ing procedure consists in pairing these two stimuli, with ng < 
ening of the light coming a fraction of a second before t e pu 

air. Each time this sequence occurs, the subject blinks in response 
to the air puff. Presently, however, he begins to blink as soon 
as the light changes, before the puff comes. Since the changing 
light now produces a blinking response which it formerly did 
not produce, learning has taken place. In this setup the puff, which 
already produced blinking, is called the unconditioned stimulus, 
and blinking to the puff is the unconditioned response. The 
increase in brightness of the light is called the conditioned stimu- 
lus, and the learned response of blinking to it is the conditioned re- 
sponse. The whole learning sequence is known as conditioning. The 
same principles of reinforcement, extinction, spontaneous Te- 
covery, generalization, and discrimination that were illustrated by 
the pigeon pecking a key can also be demonstrated in classical 
conditioning situations. 

The distinctive characteristic of this kind of learning is that a 
certain stimulus elicits a response after the learning experience 
that it did not elicit before. (Because some authorities use the 
term “conditioning” rather broadly, the adjective “classical” is 
often added to “conditioning” to make clear that this form of learn- 
ing is meant.) Since classical conditioning does not involve learn- 
ing new responses, solving problems, or carrying out goal-directed 
activity, it seems at first glance to be of little importance com- 
pared to the other forms of learning so far discussed. Some theorists 
have concurred in this judgment, regarding conditioning as noth- 
ing more than a laboratory curiosity. Others, as we shall see, have 
regarded it as of central importance. One modern viewpoint is 
that its importance is mainly for the learning of emotional reac- 
tions. When Alex experienced failures at spelling and at batting, 


he reacted with discouragement and anger. These can be con- 
sidered unconditioned re 


failure. When he 
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tudes toward all the people, objects, and situations we encounter 
in our lives. 

A fifth and final laboratory example of learning is the maze. 
The principles involved in maze learning are similar to those in 
the key-pecking experiments already mentioned, but the maze has 
been used so widely in learning experiments that it deserves sepa- 
rate mention. The original mazes were modeled after the Hamp- 
ton Court maze in England, where tall hedges formed an intri- 
cate pattern of winding pathways through which visitors tried to 
find their way. Miniature versions of this maze were used to study 
the ways in which both humans and animals learned to avoid the 
many blind alleys and find the correct path from the starting point 
to the goal. Over the years mazes have become simpler and simp- 
ler as psychologists tried to find the basic principles involved in 
learning them. The winding pathways of the Hampton Court maze 
have been replaced by alleys with simple series of left and right 
turns. Even these, however, have been considered too complicated, 
and have been largely replaced with “mazes” in which there is 
only a single choice to be made, between a right and a left turn. 
Because of their shape, these are known as T mazes. These are 
too easy to be useful with humans, but are widely used with ani- 
mals. The learner (typically a well-domesticated white rat) runs 
along the stem of the T and then enters one of the arms. If he 
chooses the correct one, he finds food at the end of the arm. The 
rate at which he learns which side is correct can be studied in re- 
lation to the amount of food he gets, how hungry he is, and other 
more complex variables. Finally, even the one choice point may be 
eliminated, leaving only a straight runway. Here one can study 
only how fast the animal runs and whether he runs at all. Sim- 
ple as it is, the runway has proved valuable for studying many 
aspects of reinforcement, extinction, and even discrimination. 


Advantages of Laboratory Study 


Hopefully, the preceding examples have helped to indicate 
the relation between the sorts of learning situations studied in the 
laboratory and the sorts of most interest to educators and to others 
directly concerned with the problems of learning in everyday life. 
However, it may be well to discuss certain aspects of the relation- 
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ships more explicitly, Two main points need to be considered: 
(1) what psychologists gain by studying learning under the some- 
what artificial conditions of the laboratory, and (2) what difficul- 


ties are involved in applying the results to other, nonlaboratory, 
situations. 


remembering only what he 
nvinced that a certain new 
ill work better than the old 
striking successes with the 


they are should be decided 
n, not by spur-of-the-moment intuition. If 


Essentially, this gai 
thing at a time. When 
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so that it will not interfere with our studying another variable. 
Suppose, for example, that we want to find out whetner it is easier 
to learn spelling when words that are likely to cause confusion with 
one another (e.g., “separate” and “cemetery”) are in the same les- 
son or in different lessons. We can study this problem by prepar- 
ing two sets of lessons, one according to each principle, and using 
them with two sets of students. However, we must see to it that 
the two sets of students do not differ in intelligence, interest in spell- 
ing, previous experience with these words, or motivation to do well 
in their studies. We must also see that the groups are taught by teach- 
ers who do not differ in teaching ability or in enthusiasm. In other 
words, we must control intelligence, motivation, and all the other 
above variables except arrangement of the words in the spelling les- 
sons. Only then can we be confident that any difference we find in 
mastery of the spelling lessons is due to arrangement of the words ra- 
ther than to some other difference between the two sets of students. 
Because the practical demands of the classroom make it difficult 
to obtain such control, or even to know whether one has obtained 
it, it is valuable to have laboratories specially arranged for doing 
well-controlled experiments. 

Though laboratory studies of learning provide great advantages 
in measurement and in control, we must not suppose that they pro- 
vide easy answers to questions about the practical management of 
learning. Because they typically study single variables out of their 
usual context, laboratory experiments can seldom give direct an- 
swers to questions about how these variables work together in 
that context. For example, it has been established in a number of 
experiments that larger reinforcers lead to better performance 
in a learning situation. Does it follow that the more lavishly a 
teacher praises students for their successes, the better the students 
will do their work? Perhaps, but there are various reasons why 
this result might not follow. The students might become so used 
to this effusive praise that it would soon be no more reinforcing to 
them than mild praise would be to other students. Or, perhaps, 
those who failed to win the praise might be all the more frustrated 
because the praise they missed was so desirable, which might 
result in more disappointment and anger becoming conditioned 
as responses to the stimuli of the whole situation. Moreover, nei- 
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ther the reinforcers nor the learning tasks used in the laboratory 
are likely to be the same ones used in the classroom (in fact, many 
of the studies on reinforcement magnitude were done with ani- 
mals); this kind of difference will probably make the outcomes 
different in degree, and possibly in kind. Finally, even if the sub- 
jects, tasks, and reinforcers are all similar in the laboratory and in 
the classroom, the difference between working alone in an unfa- 
miliar situation (the laboratory) and working in a familiar group 
situation (the classroom) may produce considerable differences in 
behavior. For all of these reasons we would be rash to generalize 
from the laboratory studies to the classroom application. 


VARIABLES, LAWS, AND THE PROCESS 
OF ABSTRACTION 


Researchers in the field of learning, like those in any other 
branch of science, are concerned with discovering scientific laws. 
All the experimental procedures we have discussed are directed 
toward such discovery. A law is a statement about the conditions 
under which certain things occur. Some laws are highly precise 
and accurate, as in the physicist’s statement that the period of a 
pendulum is proportional to the square root of its length. Other 
laws are much less precise and much more subject to error, as 
when the amateur weather Prophet maintains, “Red at night, 
sailor’s delight; red in the morning, sailor’s warning.” In both cases, 
however, we are being told that certain events occur under cer- 
tain conditions, Given those conditions, we can predict that these 
events will occur, The prediction need not always be correct, so 
long as it is correct often enough to be useful. If bad weather oc- 


curs 75 per cent of the time when the morning sky is red and only 
20 per cent of the time when the 


morning sky is gray, the ama- 
tuer weather prophet has a valid law, even though not a wholly 
accurate one, for predicting the weather. 


Kinds of Variables and Laws 


All laws State a relationship between a dependent variable and 
one or more independent variables. A dependent variable is one 
about which we make a prediction; an independent variable is one 
we use to make the prediction. In the above examples, the length 


—- 
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of the pendulum and the color of the sky were independent vari- 
ables while the period of the pendulum (the time it takes to make its 
complete swing) and the weather were dependent variables. In 
a study of learning, the dependent variable is some aspect of the 
learner’s performance, while the independent variables may be 
any characteristics of the learner, the task, or the situation. 

In some cases these variables and the laws relating them to one 
another involve merely the presence or absence of something. 
This is the case with the weather-prediction law. Red in the eve- 
ning indicates good weather and red in the morning bad 
weather; that is all we are told. Does it matter whether the red is 
pale or deep? Will the bad weather be a drizzle or a hurricane? 
The law does not tell us. In other words, it deals with qualitative 
information only, information about the kinds of events that occur. 
In other cases, however, degrees of the independent variable are re- 
lated to degrees of the dependent variable. The physicist’s law tells 
us how much of a change in pendulum length will produce how 
much of a change in period. This, then, is a quantitative law, one 
that gives information about amounts of things, about the degree 
to which certain events occur. In learning, the statement that re- 
moval of a reinforcer produces extinction is a qualitative law, 
since it refers only to whether or not reinforcement is removed 
and whether or not extinction occurs. On the other hand, the state- 
ment that a larger reward results in a higher level of performance 
is quantitative, since it deals with different amounts of reward and 
different levels of performance. Both qualitative and quantita- 
tive laws are found in all branches of science, but in general the 
more highly developed sciences tend to have more quantitative laws, 

Independent variables also differ in another respect. Some in- 
dependent variables can be directly varied by an experimenter. 
He can arrange the independent variable to suit himself and then 
see what happens to the dependent variable. For example, a physi- 
cist can change the length of a pendulum in order to see how this 
difference affects the rate at which the pendulum swings. Likewise, 
a psychologist can stop reinforcing a pigeon for pecking a key and 
Watch what happens to the rate of pecking. Such a study, in which 

e independent variable is manipulated by the researcher, is called 
an experiment. If the experiment is properly controlled, we can con- 
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clude definitely that the changes in the independent variable 
caused the changes in the dependent variable. 

There are other independent variables, however, that cannot be 
manipulated by an experimenter. The weather prophet cannot 
make the sky red in order to see what will happen to the weather. 
He has to wait until the sky gets red by itself and then watch for 
changes in the weather. This is still a perfectly valid scientific 
study, but it is not an experiment, for the researcher does not ma- 
nipulate the independent variable. A similar example in learning 
would be a study of memorization rate in people of varying IQs. 
Here IQ would be the independent variable and rate of learning 
the dependent variable. The researcher could not change a per- 
son’s IQ; he could only choose subjects who already had different 
IQs and then compare their learning rates, A difficulty with this 
kind of nonexperimental study is that we can seldom be sure just 
what is causing what. We would not be likely to say that red in the 
evening sky caused good weather; presumably some atmospheric 
condition caused both. The red sky was merely an indicator of good 
weather, not its cause. We might say that high IQ caused faster 
memorizing (if we found such a relation, as we quite likely would 
not), but perhaps it would be just as reasonable to say that high 
memorizing ability was responsible, over a lifetime, for the devel- 
opment of a high IQ. These nonexperimental studies still give us 
laws that are valuable for Prediction, but for telling us what 
causes what they are definitely inferior. For this and other reasons, 
experimental studies are preferable wherever it is possible to do 


them, and the great majority of psychological studies of learn- 
ing are experiments, 


As we have seen, scientific law: 
They may indicate simply that w 
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the primary focus of science in general and hence of the psychol- 
ogy of learning in particular. 


Abstraction 

Scientific laws are statements about the way the world oper- 
ates, and like all statements they involve abstractions. Whenever 
we apply words to things and events, we ignore a great deal of 
what is there in order to focus attention on what this particu- 
lar thing or event has in common with others. For example, when 
we call something a car, we are ignoring its make, model, year, 
and color in order to emphasize the features it has in common with 
other cars. In other words, we are abstracting its “carness” from 
all the other characteristics of this particular battered, brown, 
eight-year-old, two-door Ford. The same is true when we use the 
term “discrimination” to describe the behavior both of a rat learn- 
ing whether to turn right or left in a T maze and of a child learn- 
ing whether to use “ar” or “er” in spelling a word. Again, we are 
dealing with an abstract concept, ignoring most aspects of the 
two situations in order to concentrate on one thing that they have 
in common. 

Since some degree of abstraction is inescapable, any statement, 
however concretely “factual,” is an abstract formulation that tells 
only a part of the truth. To say, “Columbus crossed the ocean blue 
in 1492,” is to give only the barest outline of that momentous 
event. Even to say, “John Doe dropped a copy of A Tale of Two 
Cities from his school desk to the floor in the middle of an arithme- 
tic lesson in Room 6 on the morning of March 17” is to give 
only a minute fraction of the detail that could have been observed. 
For all their tremendous value, words are only pale shadows of 
the things they represent. 

This process of abstraction goes on in all description and in all 
thought. Every intellectual activity involves the organization and 
simplification of “reality” as it is presented to our senses. This 
statement is true whether we consider simple perceptions or com- 
plex thoughts, and whether we look at science, art, sports, business 
or any other area of human interest. Imagine, for example, what 
the radio broadcast of a football game would sound like if it were 
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presented by a society editor, or by the proud father of one of the 
players, or by a foreigner studying American customs. The so- 
ciety editor might spend all her time describing the lovely uni- 
forms and discussing the family backgrounds of the players; the 
proud father might report only what his own son was doing; and 
the foreigner might be much more interested in the cheers and card 
stunts than in the progress of the game. Each of these three de- 
scriptions could be perfectly true and valid as far as it went, but 
what football fan would accept any of them as an adequate re- 
port of the game? Each description, including the professional 
sportcaster’s, would deal with only some aspects of the “real 
game”; each would reflect not only what “really happened” but 
also the interests and biases and vocabulary of the reporter. 

In effect, there is no such thing as pure reality; there is only re- 
ality as described and interpreted and reacted to by someone. 
Some descriptions are more accurate or more detailed than oth- 
ers, but no description is complete. Even if someone were ambi- 
tious enough to collect descriptions of a football game from 
everyone who was there, check them with: other kinds of evidence, 
spend years sifting and combining these accounts, and give a final 
report of the game many volumes long, there would still be details 
that were omitted. The Teport would still be only an abstract, even 
though a very detailed one, of what happened at the game. In any 
case, no one would read the report, since each possible reader 
would be interested in those aspects of the game that he considered 
interesting or important, not in the most detailed account that hu- 
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with picking out certain aspects of the situation to be related to 
other aspects. In relating the period of a pendulum to its length, 
the physicist need not concern himself with what the pendulum 
is made of or with whether it is a clock pendulum, a plumb bob, 
or a museum display. Similarly, the psychologist’s statement that 
removal of the reinforcer reduces the frequency of the response 
applies whether the reinforcer is food or praise and whether the 
response is pecking a key or studying spelling. The laws connect- 
ing independent and dependent variables are not descriptions of 
any particular event—they are statements about the conditions 
under which certain kinds of events occur. Scientific laws are not 
concerned specifically with the red sky in Chicago on June 10 or 
with the pendulum of John Jones’s grandfather’s clock, but with 
all red skies or all pendulums. Some laws may be quite narrow in 
the range of phenomena to which they refer, but even the narrow- 
est always refers to certain kinds of events, never merely to a sin- 
gle unique event. 


The Value of Scientific Laws 


Scientific laws serve two main purposes. One purpose is prac- 
tical—to provide man with the means of predicting and control- 
ling events. Simply being able to predict what will happen and 
thus take measures to deal with it is of considerable value. Such 
prediction is what keeps the weather bureau and the various in- 
vestment advisory services in business. It is even more useful, 
however, to be able to control events. An independent variable 
that we can manipulate gives us some degree of control over the 
dependent variable. This fact is the meaning of the saying that 
“knowledge is power.” Only if we have a law about the condi- 
tions under which certain kinds of events happen can we set up 
the necessary conditions for one such event to happen when we 
want it to. It is not necessary, of course, for the law to be formally 
stated; much of our practical knowledge is very casual. However, 
the more complete and accurate our formulation of the laws is, the 
better able we are to control the world around us. 

Scientific laws also have a less utilitarian value. Since earliest 
times man has sought to understand the world in which he lived. 
From the child trying t d out what makes the watch tick to 


A 
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the cosmologist trying to find out what makes the universe tick, 
man is constantly asking “what is it?” and “how does it work?” 
and “why?” No practical benefits are required to justify this cu- 
riosity; to gain the knowledge is benefit enough in itself. This bene- 
fit is the basis of pure science, the search for fundamental knowl- 
edge about the world. 

Both of these purposes of science are exemplified in the study 
of the psychology of learning. The laws of learning are of crucial 
importance to education, to industrial, military, and other forms 
of training, to child rearing, to psychotherapy, and to a variety 
of other practical areas of work. They are also basic to an under- 
standing of how individuals and societies come to be as they are, 
of how knowledge is obtained, indeed of how man acquires his 
unique humanness. Hence for both reasons an increase in our 
knowledge of the laws of learning is much to be desired, 


THE NATURE OF THEORIES 


Researchers are rarely satisfied, however, merely to collect 
more and more laws about learning or about anything else. To 
satisfy man’s desire for understanding, knowledge must be or- 
ganized. An encyclopedia full of laws relating each of a vast num-. 
ber of independent variables to each of a vast number of depend- 
ent variables might give its owner the emotional satisfaction of 
having a great deal of knowledge available, but it would not give 
the intellectual satisfaction of understanding the topics involved. 
Such satisfaction requires more general knowledge than that pro- 
vided by this imaginary encyclopedia of laws. Even for practical 
purposes, such an encyclopedia would be cumbersome. It would 
be more convenient to have general principles from which the spe- 
cific laws could be deduced. So, although the establishing of laws 
is In one sense the most basic activity of science, it is not the end 
of scientific activity. Much of the researcher’s effort is dedicated 
toward establishing more general Principles or interpretations. 
This effort takes him into the realm of scientific theory. 

_ We have already seen that description represents some abstrac- 
tion and organization and simplification of the events being de- 
scribed, and that the statement of laws represents a higher level 
of abstraction. With theory we come to a still higher level of ab- 
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straction. It differs in degree but not in kind from the lower levels. 
It is a serious mistake to think of a realm of theory that is sepa- 
rate and different from the realm of fact. When people speak of 
“facts,” they are sometimes referring to descriptions of events 
(“It is a fact that Richard M. Nixon became President in 1969.”) 
and sometimes to laws (“It is a fact that hydrogen and oxygen 
will combine to form water.”). As we have seen, both descriptions 
and laws represent organizations and simplifications of what is 
“really there” according to the language and the biases and the 
objectives of whoever is describing the event or stating the law. 
Theory exemplifies the same processes, but in still greater degree. 
It would be reasonable to say either that facts represent one kind 
of theory or that theories represent one kind of fact, but most 
reasonable to say that fact and theory represent different degrees 
of what is basically a single process. 

What is a theory? This is not an easy question to answer, partly 
because there are a number of different opinions about what a 
theory should be like and what functions it should serve. It 
is really only through studying different theories—noting their 
similarities and differences and the purposes their creators had 
in mind—that one can get a general understanding of what the- 
ories are. In a sense, therefore, the rest of this book is an attempt 
to answer this question. In this chapter we can hope to get only 
a rough and general overview. 

In the broadest sense, a theory is a systematic interpretation of 
an area of knowledge. In the psychology of learning, “system” or 
“systematic interpretation” is probably a better term than “the- 
ory,” for theory is sometimes used in a narrower sense to refer 
to a kind of formal logical system. However, in this book we will 
use theory, system, and systematic interpretation as synonyms. 


Three Functions of Theory 


A theory of learning is usually three different but closely 
related things. First, it is an approach to the area of knowledge, a 
way of analyzing and talking about and doing research on learn- 
ing. It represents the researcher’s point of view about what as- 
pects of learning are most worth studying, what independent 
variables should be manipulated and what dependent variables 
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studied, what research techniques employed, and what language 
used to describe the findings. It focuses his attention on certain 
topics and helps him decide which of all the possible abstractions 
will be most useful. Thus theory serves as a guide and a source of 
stimulation for research and for scientific thought. . 

Second, a theory of learning is an attempt to summarize a large 
amount of knowledge about the laws of learning in a fairly small 
space. In this process of summarization, some exactness and de- 
tail are likely to be lost. In such precise and well-developed sci- 
ences as physics, theories do quite well in summarizing laws so that 
the same exact predictions can be made from the theories as from 
the much more detailed laws. Psychology, to date, has been less 
successful in finding such theories. Theories of learning, in at- 
tempting to summarize large amounts of knowledge, lose a good 
deal in completeness and precision. They are simplifications or 
skeletal outlines of the material with which they deal. As such, 
they represent a gain in breadth, in organization, and in simplicity, 
but also a loss in accuracy of detail. 

Third, a theory of learning is a creative attempt to explain what 
learning is and why it works as it does. The laws give us the “how” 
of learning; the theories attempt to give us the “why.” Thus they 
seek to provide that basic understanding which is one of the goals, 
not alone of science, but of all forms of scholarship. Theories 


represent man’s best efforts to determine the underlying structure 
of the world in which he lives. 


Intervening Variables 


In most cases, theorists have sought this underlying structure 
in entities that were not visible to the observer. Theorists in the 
field of chemistry, for example, assumed the existence of mole- 
cules long before anyone had ever seen a molecule under the mi- 
croscope. They did so because the laws of chemistry formed a 
simpler and more logical pattern if all substances were assumed to 
be made up of molecules. The laws of chemistry did not them- 
selves deal with molecules, but with substances that could be seen 


and touched and weighed. The molecules were in effect invented 


by the theorists as an explanation for the laws. This invention was 
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a creative guess that has received more and more support from 
later evidence and that has contributed immensely to the devel- 
opment of chemistry. 

Let us consider a corresponding example, though admittedly 
a less striking one, from the psychology of learning. We might de- 
prive an animal of water for a period of time, or reduce the ani- 
mal’s daily ration of water below the normal intake, or permit 
him to drink only for a limited time each day, or feed him a lot 
of dry food without water available, or inject a salt solution into 
his stomach. The extent to which we did any one of these things 
(e.g., how long he could drink per day or how much salt was in- 
jected) would be an independent variable. We would find that 
each of these independent variables was related in much the same 
way to each of several dependent variables. As any of these inde- 
pendent variables increased, the animal would become more 
active, would drink more water when water was available, would 
run faster to a place where he had found water in the past, and 
would be more reinforced by water if water was presented as a 
reward for making some response. These relationships constitute 
twenty laws, relating each of the five independent variables to 
each of the four dependent variables. We can, however, reduce 
this number to nine laws by saying that each of the five independ- 
ent variables produces a state of thirst and that this thirst in turn 
produces changes in the four dependent variables. By hypothe- 
sizing this state of thirst, we have more than cut in half the num- 
ber of laws required to describe the relationships involved. 

So far this act of theoretical simplification is no more than what 
any layman does. It is so commonplace, indeed, that it is easy to 
overlook its importance. No one has ever seen or touched or 
weighed thirst. We have felt it in ourselves, but in anyone else we 
can only infer it. We have to infer someone else’s thirst from what 
has happened to him (the independent variables) or from what he 
does (the dependent variables). If he tells us he is thirsty, his 
statement is only one of the possible dependent variables, and not 
necessarily the most reliable. When a child gets out of bed for the 
fourth time and tells his mother he is thirsty, she is understand- 
ably more likely to trust other evidence than what the child says. 
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Anyone who thinks it is possible to differentiate sharply between 
theory and fact should consider how much theory there is in the 
simple “factual” statement, “He is thirsty.” We à 

The psychologists use of the concept “thirst differs from the 
layman’s in two respects. One is in precision. The psychologist is 
not satisfied with saying that certain manipulations will produce 
thirst and thirst will produce certain behaviors; he goes on to de- 
termine what degrees of the independent variables are related to 
what degrees of thirst and what degrees of thirst are related to what 
increases in the dependent variables. Thus the psychologist’s use 
of the concept “thirst” allows for more complete and accurate 
detail than the layman’s. 

The other difference is that the psychologist is likely to go far- 
ther than the layman in relating this concept to others. Many the- 
orists of learning have been struck with the similarities among 
hunger, thirst, pain, and a variety of other such hypothetical states 
(hypothetical because none of them can be directly observed). 
All of these tend to produce increases in activity and physiologi- 
cal changes characteristic of stress. Moreover, the termination of 
any of these is reinforcing. They have therefore often been classi- 
fied together as drives. This concept of drive provides a higher 
level of theoretical integration, bringing together far more laws 
than the separate concepts of hunger, thirst, pain, and the like. 

Theoretical concepts of the sort we have been discussing are 
often called intervening variables. This name reflects their place 
in the theoretical structure, intervening between the independent 
and the dependent variables and forming a link connecting them 
together. These intervening variables are states or conditions of 
the individual that are inferred from observations. Habits, beliefs, 


and motives are examples of intervening variables that are im- 
portant in various theories of learning. 


t er is very likely asking, “Are these inter- 
vening variables really there, waiti 
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matter. Undoubtedly both discovery and creation enter into the- 
ory, as indeed into all scientific work. A theory must lead to valid 
predictions, must be consistent with well-established laws, other- 
wise it is worthless. This requirement sets limits on the theorist’s 
creative freedom. On the other hand, the same laws can be inter- 
preted in different ways, and these interpretations are not waiting 
around to be discovered—they must be created by an interpreter. 
Perhaps the theorist is less like either an explorer or an artist than 
like an architect, limited by his materials and by the demands of 
his job, but still working with originality and imagination to pro- 
duce a new, useful, and beautiful structure. 

This is admittedly not a very adequate answer to the ques- 
tion of whether the intervening variables are “really there.” Some 
theorists talk as though they are really there and others as though 
it is merely convenient to pretend that they are there. It has been 
suggested that there should be two different terms, one for those 
that the theorist thinks he has discovered and the other for those 
that he thinks he has invented. Fortunately, we can leave this is- 
sue to the philosophers and concern ourselves only with the part 
that intervening variables play in the various theories we will be 
studying. 

In any case, the element of creativity in theory construction ex- 
plains why there are many theories of learning. Each theorist 
tries to find a way of structuring reality that will be useful 
and meaningful to him. The differences among the resulting the- 
ories reflect partly the differences in the topics that various theo- 
rists find most interesting to work with, and partly differences in 
the kinds of systematic structures that different theorists consider 
worth producing. All, however, reflect the efforts of thoughtful 
men to interpret the phenomena of learning in coherent and intel- 


lectually satisfying ways. 


Kinds of Learning Theories 


The currently important theories of learning can be classified 
in a number of ways. For our purposes, two main differences 
among these theories seem most significant. One of these is the dif- 
ference between the connectionist and the cognitive theories. 
Connectionist interpretations of learning, however much they may 
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differ among themselves, agree in treating learning as a matter of 
connections between stimuli and responses. (A response may be 
any item of behavior, while a stimulus may be any sensation. Con- 
nectionist theorists typically assume that all responses are elicited 
by stimuli.) These connections are called by a variety of names, 
such as habits, stimulus-response bonds, and conditioned re- 
sponses. Always, however, there is a concentration on the re- 
sponses that occur, on the stimuli (and perhaps other conditions) 
that elicit them, and on the ways that experience changes these re- 
lationships between stimuli and responses. Cognitive interpreta- 
tions, on the other hand, are concerned with the cognitions (percep- 
tions or attitudes or beliefs) that the individual has about his en- 
vironment, and with the ways these cognitions determine his be- 
havior. In these interpretations, learning is the study of the ways 
in which cognitions are modified by experience. 

Common sense makes use of both kinds of interpretations. 
When discussing simple reactions or more complex physical 
skills, we are likely to say, “I guess it’s just a bad habit I’ve 
learned,” or “With all that practice, his reactions have become 
very fast and smooth.” These are connectionist interpretations. 
On the other hand, when discussing matters that involve words 
or deliberate decisions, we often say things like, “He has ac- 
quired a lot of knowledge on that subject,” or “You'll have to 
learn that people don’t like to be treated that way,” or “Now I 


really understand geometry!” These interpretations are all cogni- 
tive. 


Whether a given Psychologist will prefer a connectionist or a 
cognitive theory of learning depends partly on the kind of learn- 
ing in which he is most interested. A specialist in the study of con- 
ditioning is likely to find a connectionist interpretation better suited 
to his needs, while a specialist in problem solving may find a cog- 
nitive interpretation more useful. However, each specialist will 


tend to believe that the theory he prefers is best, not only for his 
own field of study, but for the wh 


ole psycholo f learning. Thi 
tendency reflects a desire for uni i a eei 


s ty and simplicity that is one of 
the reasons why theories are developed in the first place. As a re- 
sult, some people adopt general cognitive theories of learning 
and others adopt general connectionist ones. 
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In addition, philosophical differences bias people toward dif- 
ferent interpretations. Connectionist theories lend themselves to 
greater precision and fit in better with an over-all scientific ap- 
proach in which human learning is just one part of the natural 
world. Cognitive theories, on the other hand, make more allow- 
ance for the power and flexibility of man’s intellectual processes 
and the ways in which he deals with complex problems. As a re- 
sult, scientific psychologists have more often chosen connection- 
ist interpretations, while applied psychologists (including those in 
education) have been more inclined toward cognitive interpreta- 
tions. This is by no means a hard and fast distinction, however, 
and there are signs that it may be breaking down further. 

The other major difference is between formal and informal 
theories of learning. An informal theorist gives an interpretation 
of learning in words. It is a more systematic, more precise, and 
more thorough interpretation than the proverbial man in the 
street would give, but it is fundamentally of the same sort. It is an 
attempt to explain in everyday language what learning is, how it 
operates, and what is going on in the person when it occurs. A 
formal theorist, on the other hand, tries to make his theory a for- 
mal, logical structure. As compared with the informal theorist, 
he is likely to make greater use of mathematics and to distinguish 
his intervening variables more rigorously from his independent 
and dependent variables. His goal is to create a system of postu- 
lates and theorems somewhat like Euclid’s geometry, in which a 
great many laws can be logically deduced from a small number 
of postulates. This formal procedure has the advantage of mak- 
ing clearer just what the theorist is doing. It makes the logic of the- 
ory construction more explicit and also ‘makes sloppy thinking 
easier to detect. However, it also has the disadvantage of making the 
theoretical writings harder to read and farther removed from 
everyday ideas of what an interpretation of learning should be. 
In the history of learning theory, formal systems have tended to 
develop later, building on the informal systems that preceded 
them. 

Needless to say, neither the connectionist-cognitive distinction 
nor the formal-informal distinction is an all-or-nothing matter. 
There are various middle positions between the cognitive and the 
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connectionist, and there are varying degrees of formality. Never- 
theless, these two distinctions form a convenient basis for classify- 
ing the interpretations of learning that we will be investigating 
here. We will look first at some informal connectionist theories, 
which will illustrate the ways psychologists typically interpret 
learning. Next we will consider some cognitive theories, varying 
in formality. These are less popular among psychologists of 
learning, but have been influential in social psychology and in ed- 
ucation and have important implications for the future develop- 
ment of learning theory. Finally we will look at some formal the- 
ories primarily of the connectionist sort, and at some compro- 
mise positions. It is hoped that this survey will indicate what 
learning theories are, what they try to accomplish, how successful 


they are, and what they can contribute to our understanding of the 
learning process. 


CONTIGUITY THEORIES 
IN THE CONNECTIONIST 
TRADITION 


MAN’s INTEREST IN psychology has a long history. At least since 
the time of the ancient Greeks, philosophers have been speculat- 
ing about topics that are now considered part of psychology. How 
do we think and feel and learn and know and make decisions and 
act.on them? Attempts to answer these questions make up a con- 
siderable part of the history of philosophy. It was not until the 
Nineteenth century, however, that attempts were made to study 
these topics experimentally. The first psychological laboratory 
was founded by Wilhelm Wundt in Germany at a time traditionally 
given as 1879. Although research in psychology had been going on 
before then, this date marks the point at which modern scientific 
psychology was placed on a definite institutional footing. 

Wundt and his colleagues in early scientific psychology, like the 
philosophers from whom they drew much of their inspiration, 
were largely interested in conscious experience. They wanted to 
understand man’s sensations and thoughts and feelings. They 
wanted to take the continuous flux of conscious awareness and 
analyze it into its basic components. Are memory images the 
same as sensations? Are feelings a special kind of sensation or are 
they something radically different? How is the intensity of a sen- 
Sation related to the intensity of the physical stimulus that produces 
it? These were the sorts of questions that the early experimental 
Psychologists studied. 

This kind of psychology, 
&reat extent the standard for t 

€ research in psychological labor: 
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developed in Germany, became to a 
he rest of Europe and for America. 
atories and the discussion in 
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textbooks of psychology were principally based on this approach. 
Its acceptance, however, was never complete. In the United 
States, for example, there was always a considerable trend toward 
the study of objective behavior as well as of conscious experi- 
ence. American psychologists were interested in what people did 
as well as in what they thought and felt. Along with this went an 
un-European interest in applied Psychology, in the practical uses 
to which psychological knowledge could be put. So although 


American psychology developed largely within the German mold, 
it had its own individual features as well. 


WATSON AND BEHAVIORISM 


Early in the twentieth century, these features of American psy- 
chology came more and more into conflict with the German tra- 
dition. Pressure increased to break the traditional mold and to de- 
velop a psychology that was frankly oriented toward objective 


behavior and toward practical usefulness. To varying degrees, 
Psychologists were taking up the cry, “Enough of studying what 
people think and feel; let’s begin studying what people do!” This 
movement found its most vocal spokesman in John B. Watson 
(1878-1958). It was through his vigorous attacks on traditional 
psychology and his attempts to build a radically different system 
that American theoretical Psychology came into its own. 

In 1903 Watson received the first doctorate of philosophy in 


early interests. He was im- 
animal behavior it is possible 
simply study what the animal 
e same with humans? Behavior 
while consciousness belongs to 
consciousness from our discus- 
’s professors at Chicago agreed 


i ) s raditional psychology, but con- 
sidered his solution too radical. Perhaps they tages that, like 
so many young rebels, he would become more conservative with 
age and responsibility. In Watson’s case, however, this change 
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oo After joining the faculty at the Johns Hopkins 

Sieme y in 1908, he became all the more convinced that his 

1913 he poon was the answer to psychology’s problems. In 

sticks ih nee the first formal statement of his position, an 

R t é Psychology as the Behaviorist Views It, and the 
i gical revolution known as behaviorism was under way. 

e reason for the name “behaviorism” is clear enough. Wat- 


for example, has 
iliar motto from 
*) But we must 
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W. a R . . : . 
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an the study of conscious- 
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learne a matter of conditioned reflexes, that is, of responses 
by what is now called classical conditioning. We do not 


considerable measure 
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show sociability or aggression because we are born with an in- 
stinct to do so, but because we have learned to do so through con- 
ditioning. 

Watson’s demolition of theories extended not only to instincts 
but to other supposedly innate mental characteristics of man. 
He denied that we are born with any particular mental abilities 
or traits or predispositions. All we inherit is our bodies and a few 
reflexes; differences in ability and in personality are simply dif- 
ferences in learned behavior. Thus Watson was in several respects 
a strong exponent of environment as against heredity in the fa- 
miliar nature-nurture controversy. What we are depends entirely 
(except for clearly anatomical differences) on what we have 
learned. And since what has been learned can be unlearned, this 
e, either in general or in a par- 
ticular person, was greatly subject to change. There was practi- 
y conditioned, might become. 
ty with faith in the power of 
logy and captured the popular 
me more specific ideas about 
ns for child Tearing, education, 
on. When we consider how well 
merican belief in equality of op- 
nal practicality, and faith in prog- 
viorism came to occupy the cen- 
ter of the American Psychological stage. 
re Watson as suddenly presenting 
romptly and universally adopted. 
hology as the Behavorist Views It 
to environmentalism were already 
nstinct doctrine was more the work 
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whom any discussion of the change is likely to center. He gave be- 
haviorism its name, its loudest voice, and its sense of mission. 


Watson’s Interpretation of Learning 


What, then, was Watson’s interpretation of learning? We have 
already seen that he regarded all learning as classical condition- 
ing. We are born with certain stimulus-response connections 
called reflexes. Examples are sneezing in response to an irritation 
of the nose and the knee-jerk response to a sharp tap on the knee. 
These reflexes, according to Watson, are the entire behavioral 
Tepertoire that we inherit. However, we can build a multiplicity 
of new stimulus-response connections by the process of condition- 
ing. If a new stimulus occurs along with the stimulus for the re- 
flex response, after several such pairings the new stimulus alone 
will produce the response. This conditioning process, first de- 
scribed by Pavlov, makes it possible for each response in the origi- 
nal repertoire of reflexes to be elicited by a great variety of new 
stimuli in addition to the ones that originally elicited it. This, ac- 
Cording to Watson, is how we learn to respond to new situations. 

Such conditioning, however, is only part of the learning process. 
We must not only learn to respond to new situations, we must also 
learn new responses. Sneezes, knee jerks, and the like would not 
Carry us very far in dealing with complex situations. How = com- 
plex new habits be learned? The answer, according to Watson, 
is by building up series of reflexes. Walking, for See 1S i 
quence of many responses, such as putting the weight on one toot, 


swingi d, bringing it down, thrusting the 
inging the other foot forwar ging wey E 


Weight forw. one foot to the other, i 
seen srr proper order constitute the skilled per- 
formance of walking. The building up of such a sequence Is Pos: 
Sible because each response produces muscular sensations that 
become stimuli for the next response. Thus new and complex be- 
havior is acquired through the serial combination of simple re- 
flexes, 


L is kind of learning in more detail. Consider one 
et us look at this kin ging the leg forward. Origi- 
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place toward which one is walking. However, the person can 
swing his leg forward only if his weight is on the other foot. 
Hence, whenever he swings his leg forward, he does so in the 
presence of those sensations from his own body that result from 
having his weight on the other foot. Those sensations thus are 
paired with the response of swinging the leg, and through re- 
peated pairings they come to elicit leg swinging. Hence, in 
the well-learned habit of walking, the sensation of having weight 
on one foot automatically elicits the conditioned response of 
swinging the other leg forward. This response merges with the oth- 
ers in the sequence, each providing the stimulus for the next re- 
sponse. The sequence eventually becomes so well integrated that 
for practical purposes we can speak of the whole process of walk- 
ing from one place to another as a single response, even though 
it is actually a complex sequence of stimulus-response connections. 

A reader of the above discussion is very likely dissatisfied with 
Watson’s explanation of complex learnirg. What, one may fairly 
ask, determines that this particular sequence of stimulus-response 
connections will be formed? Why does the sensation of weight 
on one foot elicit the response of swinging the other leg forward? 
Reading between the lines in Watson’s last book (Behaviorism, 
first published in 1924), one gets the feeling that he was still 
seeking the answer to this problem. He had two different answers, 
neither completely adequate, and the relation between the two 
was still unclear. One answer was to say that the stimulus- 
Tesponse connections that make up the skilled act are conditioned 
reflexes. Each response produces sensations that become con- 
ditioned stimuli for the next response, and thus the whole sequence 
of conditioned stimulus-response connections is formed, This for- 
mulation gave Watson the satisfaction of having reduced complex 
habits to their simple building blocks, conditioned reflexes. Now, 
he said, we can turn to the physiologists to explain why condi- 
tioning occurs; we as behaviorists have done our job. However, 
Watson never really carried the analysis through in detail. If feel- 
ing one’s weight on the left foot is the conditioned stimulus and 
swinging the right leg forward is the response, what is the uncon- 
ditioned stimulus which always guarantees that the response will 
occur so that conditioning c 


an take place? To this crucial ques- 
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tion, Watson had no answer. As a result, his reduction of complex 
behavior patterns to sequences of conditioned reflexes is more 
apparent than real. 

Watson’s other explanation of this form of learning is in terms 
of two principles: frequency and recency. The principle of fre- 
quency states that the more frequently we have made a given re- 
sponse to a given stimulus, the more likely we are to make that re- 
sponse to that stimulus again. Similarly, the principle of recency 
states that the more recently we have made a given response to a 
given stimulus, the more likely we are to make it again. 

Watson illustrates these principles with the example of a three- 
year-old child learning to open a puzzle box with candy inside. 
The child turns the box around, pounds it on the floor, and makes 
a variety of other useless responses. Finally, by chance, he presses 
a button on the box, which is the one response that will release the 
lid so he can open the box and obtain the candy. Since the box is 
now open and the candy obtained, the child is n 
Presence of the stimuli that kept him working at the box. The last 
Tesponse he made in the presence of those stimuli was the response 
of pressing the button. The next time his father puts candy in the 
box and closes the lid, the child will go through much the same 
sequence of trial and error as before. However, by chance, he will 
try some new responses and leave out some of those from 
last time. Again, however, the last response he makes will be 
that of pressing the button, since that is the one which changes the 
stimulus situation. Every time he works with the box, pressing 
the button occurs, whereas other responses may or may not. 
Thus in the long run button-pushing gains a lead in frequency. 
Since it is always the last response, it always has a lead in recency. 
As a result, button pushing occurs sooner and sooner on succes- 
Sive experiences with the box. Since button pushing solves the 
Problem, other responses have less and less chance to occur on 
Successive experiences with the box. Button pushing as a response 
to the stimulus of the closed box has been learned. : 

This illustration shows only how a single response, pressing a 
button, is learned. The problem might, however, have required the 
child to make a series of several successive Tesponses to open the 
box, with each response changing the situation so that the next 


o longer in the 
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response could be made. Pressing the button, for example, might 
have opened an outer lid and revealed a lever that had to 
be moved sideways in order to open the inner lid. In this case both 
button pushing and lever moving would be learned in the same 
way, since each would change the stimulus and thus become the 
last response to the old stimulus. Such a series of responses could 
be extended indefinitely. 

Why, then, does one particular response rather than others oc- 
cur to the stimulus at a given place in a complex sequence? Wat- 
son’s answer is that during learning many different responses oc- 
cur to the stimulus, but that through the process described above, 
most of them drop out. The response that changes the situation 
gains in frequency and recency until it comes to occur as soon 
as the stimulus is presented. That particular stimulus-response unit 
in the sequence is then complete. 

All of these statements about the learning of new responses are 
left rather undeveloped in Watson’s treatment. How are condi- 
tioning; the principle of frequency, and the principle of recency 
related? How does the fact that the learner may at first make some 
wrong response much oftener than the right one, yet still eventu- 
ally learn the right one, fit in with the principle of frequency? Wat- 
son does not tell us. He was confident that complex learning could 
be explained by simple principles, but his attempted explanations 


were tentative and were never organized into a clear and consist- 
ent theory. 


Special Kinds of Learning 


What about the learning of emotional reactions? Here Watson 
makes a small concession to heredity, since he recognizes three in- 
nate patterns of emotional reaction. In principle these reaction 
patterns are the same as reflexes, for we can state what move- 
ments (including those of the internal organs) they involve and 
what stimuli will produce them. However, they are more compli- 
cated than what is usually meant by a reflex. The three emotional 
reaction patterns may for convenience be labeled as fear, rage, 
and love. We must note, however, says Watson, that these labels 
refer to patterns of movement, not to conscious feelings. If 
we bang a gong close to a child, and he starts to cry, we may de- 
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scribe this event by saying that the stimulus of a loud noise has 
produced the emotion of fear. However, we are simply giving a 
name to the behavior we see, not commenting on the child’s feel- 
ings. 
Emotional learning involves the conditioning of these three 
patterns of emotional response to new stimuli. The above example 
of innate fear was taken from a famous experiment of Watson's, 
which can also be used to illustrate conditioned fear. Little Albert, 
aged 11 months, was permitted to play with a white rat, which he 
did happily and with no sign of fear. A metal bar was then hit 
with a hammer close behind him. He started and fell sideways. 
This sudden loud noise was repeated a number of times just as 
the rat was presented to him, and each time he reacted in the same 
way, sometimes also whimpering. These responses indicate that the 
noise was an unconditioned stimulus for fear. After this training, 
the rat was presented without the noise. Albert fell over, cried, and 
crawled away from the rat as fast as he could. This change indi- 
cates that through the training procedure the rat had become a 
conditioned stimulus for fear. According to Watson, such condi- 
tioned and unconditioned responses account for all of our emo- 
tions. 

What about the acquisition of knowledge? Can conditioning 
be used to explain how one learns, for example, the facts of his- 
tory? Certainly, says Watson, for this knowledge consists simply 
of saying certain words, aloud or to oneself. The response se- 
quence involved in saying “William the Conqueror defeated Harold 
the Saxon at Hastings in 1066” is in principle no different from 
that involved in walking across a room. A question, such as 
“How did the Norman conquest occur?” elicits the statement, 
which is itself a sequence of words with each word a conditioned 
stimulus for the next one. Acquiring knowledge is a process of 
learning to give the proper sequence of words in response to a 
question or other conditioned stimulus. 

All of our behavior, says Watson, tends to involve the whole 
body. When we think, we may pace the floor or furrow our brows. 
We announce our opinions with smiles or waves of the arm as well 
as with words. We cannot therefore really say that emotions are 
responses of the gut or that thinking is made up of vocal responses. 
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These are the dominant but by no means the only responses in- 
volved. Everything we think, feel, say, or do involves activity of, 
to varying degrees, the entire body. This is probably the most fun- 
damental credo of behaviorism. 


Evaluation of Watson 


Watson’s great contribution to the development of psychology 
was his rejection of the distinction between body and mind and 
his emphasis on the study of objective behavior. This battle was 
so effectively won that most of the learning theory in America to- 
day is, in the broader sense of the word, behaviorist. In this book 
we will see a number of theoretical systems that represent differ- 
ent variations on the behaviorist theme. All of them have in com- 
mon a concern with objective behavior, a strong interest in ani- 
mal studies, a preference for stimulus-response analysis, and a 
concentration on learning as the central topic in psychology. This 
fact makes Watson in some ways the intellectual father or grand- 
father of a large portion of the systems we will be considering here. 

On the other hand, Watson was much less thorough than he 
might have been in dealing with the detailed problems of learn- 
ing. We have already seen the incompleteness and inconsistency 
in his treatment of complex learning. 

In his eagerness to build an objective psychology, he was some- 
what cavalier with the matter of logical thoroughness. Perhaps 
if he had worked on his theory longer (after the 1920's he stopped 
publishing and devoted himself exclusively to applied psychol- 
ogy) he would have extended his system to deal with some of 
these problems. More likely, however, his zeal for freeing psy- 
chology from subjectivism and nativism was incompatible with la- 
borious theoretical completeness. In any case, Watson is now ad- 
mired mainly for his philosophical trail blazing rather than for his 
detailed system building. It has remained for others to try to 
build, within the behaviorist framework, a more complete theory 
of learning. 


GUTHRIE’S INTERPRETATION OF LEARNING 


Of those who have continued in the behaviorist tradition in re- 
cent years, the one who has remained closest to Watson’s original 
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position is Edwin R. Guthrie (1886-1959). From 1914 until his 
Tetirement in 1956, Guthrie was a professor at the University of 
Washington. His university teaching career thus began only ten 
years later than Watson’s, and he himself never studied with Wat- 
son. His graduate education was more in philosophy than in psy- 
chology. Nevertheless, his interpretation of learning sounds much 
like what Watson’s might have been if he had had another decade 
to work on the topic. His definitive work, The Psychology of 
Learning, was published in 1935 and revised in 1952, and his 
final theoretical statement was published in 1959. Thus Guthrie 
can be regarded as a much more nearly contemporary theorist than 
can Watson. 

Among theories of learning, Guthrie’s is one of the easiest to 
read in his own words, but nonetheless hard for someone else 
to discuss. It is easy to read because he wrote in an easy, informal 
style, making his points with homely anecdotes rather than with 
technical terms and mathematical equations. It is hard to write 
about because his casual presentation contains the germ of 
a highly technical, deductive theory of learning. Reading Guthrie 
is like reading an exciting novel that contains a difficult allegory, 
so that it can be read on an easy or a hard level. At the heart of 
his system is one basic principle of learning. Interpreted loosely, 
this principle is a source both of entertaining interpretations of 
learning and of valuable advice about the management of 
learning situations. Interpreted rigorously, it becomes the chief 
postulate of a deductive theory. This theory, so deceptively simple 
at first glance but so maddeningly complex on closer investiga- 
tion, stands as a challenge to students of learning. Has Guthrie ac- 
tually succeeded in summing up the whole field of learning in one 


key postulate? 


The Basic Principle of Learning 

Guthrie’s basic principle of learning is similar to the condition- 
ing principle that was basic for Watson, but it is stated in a still 
More general form. It says, “A combination of stimuli which has 
accompanied a movement will on its recurrence tend to be fol- 
lowed by that movement.” (Guthrie, 1952, p. 23) The principle 
may be paraphrased as, “If you do something in a given situation, 
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the next time you are in that situation you will tend to do the same 
thing again.” This principle is more general than the principle of 
classical conditioning, in that it says nothing about an uncondi- 
tioned stimulus. It says only that if a response accompanies a given 
stimulus once, it is likely to follow that stimulus again. In classi- 
cal conditioning, the response occurs with the (conditioned) stim- 
ulus during training because the unconditioned stimulus elicits 
it. This sequence of course fulfills Guthrie’s conditions for learn- 
ing. However, it does not matter to Guthrie whether the response 
is elicited during training by an unconditioned stimulus or in some 
other way. As long as the (conditioned) stimulus and the response 
occur together, learning will occur. 

In claiming to sum up the whole field of learning in that one 
statement, Guthrie was inevitably challenging others to find in- 
adequacies in the summary, and psychologists have been quick 
to answer the challenge. The first difficulty with this principle is 
that one often does many different things in the same situation. 
Which one will occur next time? This challenge is no problem for 
Guthrie; he simply replies, “The last one.” A person struggling with 
a mechanical puzzle tries many responses. If he finally makes the 
correct response, he will tend to make this same response when 
next confronted with the puzzle. We say, then, that he has learned 
how to do the puzzle. Suppose, however, that he finally gives up and 
puts the puzzle aside unsolved. The next time he sees the puzzle 
he will tend to do what he did last, namely put it aside. In that 
case we do not say that he has learned how to do the puzzle, but 
he has still learned something. In both cases he was presented with 
a combination of stimuli from the puzzle. In each case there was 
a movement that removed the stimuli. To the observer, one of 
these movements Tepresented success and the other failure, but to 
Guthrie they both Tepresented responses that removed the stim- 
uli of the unsolved puzzle and that therefore became more likely 
to occur again. In both cases a Tesponse was learned, and in both 
cases it was by the same Principle, contiguous conditioning. 

This aspect of Guthrie’s system sounds much like Watson’s 
principle of recency, since the last thing that occurred in a sit- 
uation is the one that will occur again. However, Guthrie does not 
use Watson’s other principle, frequency. Whereas for Watson a 
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stimulus-response connection is something that varies in strength 
and grows stronger with Practice, for Guthrie it is an all-or-noth- 
ing bond. The connection is either Present or absent, with no in- 
termediate variation in strength. Hence the conditioning of a 
movement to a combination of stimuli takes place completely in 
One experience, and further Practice adds nothing to the Strength 
of the connection. 

At first glance this assumption seems contrary to well-known 
laws of learning, While practice may not make perfect, it usually 
does produce gradual improvement. How can Guthrie say that all 
the improvement takes place in a single experience? We must be- 
ware, replies Guthrie, of treating a “movement” as the same thing 
as an act or an accomplishment. Guthrie is referring in his prin- 
ciple of learning to specific small movements of particular mus- 
cles. It takes many such movements working together to make up 
a skilled act. Moreover, competent performance involves not just 
One but many skilled acts, each in Tesponse to a particular com- 
bination of stimuli. Hence learning how to do something involves 
learning an enormous number of specific stimulus-movement 
Connections. Improvement in the skill is gradual even though the 
learning of each minute part occurs suddenly. 

Consider a particular skill, such as riding a bicycle. For each 
Possible position of the bicycle, a different motion is required in 
Order to keep it upright. Each of these motions, in turn, is made 
Up of movements of the arms, torso, and legs. A particular move- 
Ment of the left arm to help correct a particular kind and degree 
of tilt may be learned in one experience, but it certainly does not 
follow that the whole skill of balancing the bicycle will be learned 
$o quickly. If we also consider all the other aspects of bicycle rid- 
ng, the distinction between learning a movement and gradually 
Mastering a skill becomes evident. This illustration does not show 
that Guthrie is necessarily correct when he says that a movement 


ìs learned in one trial, but it does make this interpretation more 
Plausible, 
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many specific movements. Why does Guthrie treat them as if 
they were single movements which could be conditioned in one 
trial? Presumably what is needed here is an analysis in terms of 
hierarchies of complexity. Lighting a cigarette is a skill made 
up of many stimulus-movement connections that must be con- 
ditioned. However, once learned, this whole act behaves like a 
single movement and can be conditioned as such to combinations 
of stimuli. Guthrie does not concern himself with this relation- 
ship, but applies his principle of learning sometimes to move- 
ments and sometimes to acts, depending on the point he wants to 


make. Fortunately, this ambiguity is unimportant in most situa- 
tions. 


Guthrie's Substitute for Reinforcement 


The aspect of Guthrie's theory that has been most attacked is 
his lack of concern with success and failure, with learning to do 
the “right” thing. Whatever one did last in the situation, right or 
wrong, is what one will do again. Guthrie makes no use of the con- 
cept of reinforcement. He does not say that we learn to make those 
responses that work, or that obtain reward, Whether or not some- 
thing we do becomes learned as a response to the situation de- 
pends only on whether it changes the situation into a different 
Situation, so that it becomes the last thing done in the old situa- 
tion. Success has this result, since a solution changes a problem 
situation into a situation without a problem. Thus the successful 
act is the last one that occurs in the problem situation, and it will 
tend to occur if the problem is presented again. However, if the 
individual can somehow escape from the situation without solv- 
ing the problem, the escape response will be learned. Inefficient 
methods may be learned and retained as well as efficient ones, 
since both get the person out of the situation. Mistakes may be 


repeated over and over again. We learn, not by success or by re- 
inforcement, but simply by doing. 
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the mouth. Thus pressing the lever becomes the last thing the animal 
did in the old situation, and it becomes increasingly likely to occur. 
Suppose instead of getting food after each pressing of the lever the 
rat was simply taken out of the box as soon as it pressed the lever. 
This would change the situation even more than food did, so the 
rat should be even more likely to press the lever on his next op- 
portunity than if it had been fed. This experiment has been 
done, and the results did not confirm the prediction. (Seward, 
1942) The rats receiving food showed far more lever pressing 
than those removed from the box, contrary to what would be ex- 
pected from Guthrie’s theory. This experiment and others like it 
cast doubt on Guthrie’s view that reward has nothing to do with 
learning. 

It may be, however, that our interpretation of this experiment is 
not altogether fair to Guthrie. Although the food produced less 
change. in the total stimulus combination than did removal from 
the box, the food did produce a marked change in certain par- 
ticularly important stimuli. These were the maintaining stimuli— 
those stimuli that kept the rat active in the situation. In this par- 
ticular case the maintaining stimuli were those resulting from 
food deprivation—in other words, the stimuli of hunger. In other 
situations they might be the stimuli of thirst or pain or sexual arou- 
sal or anger or fear. At some places in his writings Guthrie sug- 
gests that it is changes in the maintaining stimuli that are crucial 
for learning. If a response removes the maintaining stimuli, by def- 
inition it solves the problem and thus becomes the last response 
in the problem situation. If it fails to remove the maintaining stim- 
uli, then no matter what other changes it may produce it cannot 
be the last response in the problem situation. By this interpreta- 
tion, we can see why food might be expected in Guthrie’s theory 
to produce more learning than removal from the box. 

However, this interpretation also raises other problems. In 
some cases responses are learned that do not remove the main- 
taining stimuli. A softball player learns those responses that 
produce successful batting and fielding, even though a base hit or 
a good catch does not reduce the competitive excitement that pro- 
vides the maintaining stimuli for playing. A rat in a maze learns 
the correct turn at each choice point, even though only the last 


46 CONTIGUITY THEORIES 


turn is followed by any change in the maintaining stimuli. In both 
cases, however, the response does change the over-all stimulus 
combination, so that it is in one sense the last response in the sit- 
uation. Thus we are faced with the problem that sometimes only 
changes in the maintaining stimuli are crucial, while at other times 
changes in other stimuli are important. How do we know which 
times are which? Guthrie does not tell us. By sometimes consider- 
ing all stimuli and sometimes only maintaining stimuli, Guthrie can 
explain any case of learning after it has occurred, but he cannot 
do so well at predicting what learning will take place. 

By now. the reader can see more clearly why Guthrie’s system 
is difficult to treat as a formal logical theory. When are we talking 
about movements and when about acts? When should we look at 
all stimulus changes and when only at changes in the maintain- 
ing stimuli? Because he concentrates on Presenting simple, enter- 


taining interpretations of learning, Guthrie never gives clear an- 
swers to these awkward questions, As a 
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preciate Guthrie’s contribution by looking at his discussions of 
some practical learning situations. 


The Breaking of Habits 


Perhaps the best known of these applications is in Guthrie’s 
three methods for changing a bad habit. All three methods de- 
pend on finding out what stimuli evoke the undesirable response 
and then finding a way of making some other response occur in 
the presence of those stimuli. This other response should then oc- 
cur again the next time the stimuli are presented. The emphasis is 
on the exact stimulus and the exact response that are connected. 
Guthrie gives the example of a ten-year-old girl who, whenever 
she came in the door of her house, threw her hat and coat on the 
floor. Time and again her mother scolded her and made her go 
back and hang them up, but to no avail. Finally the mother re- 
alized that the stimulus for the girl to hang up her wraps was the 
mother’s nagging. The next time the girl threw down her wraps, 
her mother insisted that she put them on again, go outside, come in 
the door again, and hang up her coat and hat at once. After a few 
trials of this procedure the girl learned to hang up her wraps. The 
desired response had been attached to the stimuli of coming in the 
door, and the habit of throwing down the wraps had thus been 
replaced by the habit of hanging them up. This procedure worked 
where the previous nagging had failed because this time the 
mother saw to it that the girl hung up her wraps in the presence 
of the particular stimuli (those resulting from having just come 
through the door) that had previously led to the response of throw- 
ing the wraps down. 

The first method, which may be called the threshold method, 
involves presenting the stimuli so faintly or weakly that the unde- 
sirable response does not occur. (The stimulus is then said to be 
below the threshold intensity for the response, hence the name.) 
The stimuli are then increased in strength so gradually on succes- 
sive occasions that the response never occurs. These repeated ex- 
periences with the weak stimuli raise the threshold so that stronger 
stimuli will also be below threshold. Eventually the stimuli can 
be presented at full strength without eliciting the undesired re- 
sponse, since the individual has been making some other response 
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repeatedly in the presence of the stimuli. This method is useful 
mainly for emotional responses, involving anger, fear, or the like. 
Guthrie gives the example of the old cavalry method of training 
saddle horses. If an untrained horse is saddled and ridden, it will 
buck wildly. This reaction can be avoided by employing the 
method of thresholds to replace the bucking response with a re- 
sponse of standing quietly. First a blanket is put on the horse’s 
back. This pressure on its back is the sort of stimulus that induces 
bucking, but the blanket alone is too weak a stimulus to have this 
effect. After some experience with the blanket the horse is sad- 
dled. Prior to experience with the blanket, the 
produced bucking, but now it doesn’t. Eventually, after the horse’s 
experience with the saddle, a rider can mount without producing 
bucking, though the horse would certainly have bucked if 
mounted before the training with blanket and saddle. The ex- 
perience of not bucking while successively heavier weights were 
placed on its back has eventually resulted in the horse's standing 
still even for the weight of a rider. 

The second method may be called the method of fatigue. The 
response to be eliminated is elicited again and again, until the in- 
dividual is so tired that he stops making the response and does 
something else (if only resting) instead. This other response is then 
the one likely to occur when the stimuli are presented again. 
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quently at the end of this experience than at the beginning, but 
the last thing she did was to push the match box away. Pushing 
it away was therefore what she tended to do when she next en- 
countered a match box. 

In the third method, which we may call the method of incom- 
patible stimuli, the stimuli for the undesired response are pre- 
sented along with other stimuli that can be counted on to pro- 
duce a different, incompatible response. The original stimuli then 
become attached to the new responses. Guthrie illustrates 
this method with the case of a woman college student who could 
not study because of the distracting noise. She solved this prob- 
lem by spending a period of time reading absorbing mystery nov- 
els instead of studying. These stories held her attention so well 
that she ignored the distracting noises. The stimuli of noise thus 
occurred along with the responses of reading and became at- 
tached to these responses. When she then changed back (however 
reluctantly) from reading mysteries to reading textbooks, she 
found that the noises no longer distracted her, for they were now 
attached to reading responses instead of listening responses. 

It is noteworthy that there is no reference to punishment in any 
of these methods. Someone might suggest that it was punishing 
for the one little girl to go out of the house, come in again, 
and hang up her clothes, and also punishing for the other girl to 
light so many matches at once. True, says Guthrie, but remem- 
ber that other forms of punishment had previously failed. The im- 
portant question is not whether these experiences were punish- 
ing but what they led the individuals to do. Inflicting pain on 
someone, says Guthrie, cannot be expected to change his habits 
if the pain does not occur in the presence of the stimuli that pro- 
duce the behavior. Scolding the one girl after her clothes were al- 
ready on the floor, or slapping the other girl after she had fin- 
ished with the matches, would be irrelevant to the habits in ques- 
tion. Only when the punishment resulted in a new response to the 
same stimuli was it effective. Moreover, as the threshold method 
indicates, absence of punishment may be just as good a way of 
producing new responses as is punishment. 

This attitude is characteristic of Guthrie’s interpretation of pun- 
ishment. Always look at what punishment makes the individual 
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do. If punishment succeeds in changing the punished habit, it is 
because it elicits behavior incompatible with it. If punishment 
fails, it is because the behavior elicited by the punishment is not 
incompatible with the punished behavior. Thus, if you want to 
stop a dog from chasing cars, slapping his nose as he runs 
is likely to work, whereas slapping his rear is not, The two blows 
may be equally painful, but the one on the nose tends to make him 
stop and jump backward, whereas the One on the rear tends to 
make him continue forward all the more vigorously, Hence the 
blow on the nose, by eliciting behavior incompatible with running 
after the car, makes this nonrunning more likely to occur next 
time. The blow on the rear, however, has no such effect; it may 
even strengthen the chasing. Punishment works, when it does 
work, not because it hurts the individual but because it changes 
the way he responds to certain stimuli. 

If one wants to use these methods of Guthrie’s as cure: 
bad habits, a problem at once arises, In order to obtain different 
behavior from the individual, one has to change the stimuli in 
some way. For the little girl, coming in the door and taking off 
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dealing with them. A theorist who can give us this much need not 
be ashamed of his practical contributions. 


Some Special Topics 


Guthrie’s emphasis on responses to stimuli and the ways of 
changing them shows up in other contexts, too. What of extinc- 
tion? Since Guthrie does not talk about reinforcement, he cannot 
talk about extinction as resulting from the removal of reinforce- 
ment. Instead, he says that extinction is simply learning to do 
something else. The response was learned because it changed the 
stimulus situation into a different one, thus becoming the last thing 
that was done in the original situation. If, now, the learned re- 
sponse no longer produces this change, the individual will go on 
doing various things in the situation until some other response does 
change the situation. That new response will then be the one 
that tends to occur next time. If this new response consistently 
terminates the situation, it will replace the old response. If, how- 
ever, there is no longer any consistency in what response will 
terminate the situation, behavior will be variable from time to 
time. No particular new response will be learned, but the old re- 
sponse will still be replaced by new responses, in this case, many 
new ones, If a dog has learned to get out of his yard by crawling 
through a hole in the fence, and if this hole is now mended, he may 
or may not discover some other mode of escape. In any case, how- 
ever, the response of going to the place where the hole was will 
be replaced by some other response. 

Guthrie’s interpretation of forgetting is similar. Habits do not 
weaken with disuse; they are replaced by other habits. If we for- 
get the German vocabulary we have learned, it is because the Eng- 
lish words as stimuli have become attached to other responses than 
the German words. If we lose our skill at horseback riding, it 
is because we have practiced other, competing responses in sit- 
uations that were somewhat similar to being on horseback (such 
as being on a bicycle). In most cases the details of the relearn- 
ing process are obscure, and it would be difficult to predict with 
much accuracy how much of some knowledge or skill one would 
forget under a given set of circumstances. However, this interpreta- 
tion of forgetting does provide a good starting point for studying 
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the factors that influence forgetting. Even this start is more than 
many theorists have provided for this topic. 

Forgetting, like acquisition, is usually gradual because of the 
many specific stimulus-response connections that make up a 
complex habit. If the correct responses have been attached to 
many different stimuli, it will take longer for new responses to get 
attached to all of these stimuli. Hence it is possible to make one 
definite prediction from Guthrie’s interpretation of forgetting. 
We can predict that a habit will be better retained if it has been 
practiced in a number of different situations (i.e., in the pres- 
ence of a number of different stimulus combinations). In the 
course of forgetting, new responses may replace the old, correct 
ones quickly in any one of these situations, but the old response 
will still be conditioned to many other stimulus combinations. 
On another occasion when the stimulus combination is different, 
the old response is likely to Teappear. Forgetting, like learning, is 
specific to the situation, and what is forgotten in one situation may 
well be remembered in another. The “forgotten” response to the 
changed stimulus will occur only, however, if the Tesponse in ques- 
tion was originally learned to a variety of different stimulus com- 
binations. Hence we can increase the resistance of a habit to for- 
getting (including those verbal habits that we call knowledge) 
not only by practicing it more but by practicing it in a variety of 
situations. 

The impression one gets from most of Guthrie’s writing is that 
human behavior is a very 
controlled by stimuli, 
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for the consequences of this action. Consider a person in a burning 
building who has the intention of jumping to safety. In this case 
the four components are: (1) heat from the fire, choking sensa- 
tions from the smoke, and fear, (2) the fire and the height 
that block him from simply running away, (3) the bracing of his 
muscles for the jump, and (4) the preparation of his body for the 
shock of the fall. These four components, says Guthrie, are all that 
we need to describe an intention. 

If someone suggested to Guthrie that the intention is something 
mental, over and above these four physical components, he 
might have replied with his story about a strange murder case. A 
man resolved to shoot his neighbor, and hid outside the neigh- 
bor’s house with a rifle pointed at the door and his finger on the 
trigger. While sitting there, he began to think better of his plan. He 
was about to get up and go away when the neighbor came out the 
door. He pulled the trigger and the neighbor fell. At his trial, the 
question rose as to whether or not he fired intentionally. Accord- 
ing to Guthrie, this is a meaningless question. That part of his in- 
tention which took the form of words to himself had changed, but 
the part that involved a ready trigger finger had not changed. There 
was no one intention, only a variety of bodily adjustments which 
did or did not prepare him to shoot. 

What is attention? A variety of responses that orient the sense 
Teceptors toward certain stimuli, as in looking or listening. There 
may even be scanning, involving searching movements that end 
when a certain stimulus is perceived. This formulation of attention 
makes it possible for Guthrie to reword his basic principle of 
learning in the form: “What is being noticed becomes a signal for 
what is being done.” (Guthrie, 1959) mens 

In all of these interpretations, Guthrie is insisting that the proc- 
esses involved, though they may be called by subjective terms, 
refer to objective physical movements. They may be hard to ob- 
serve, but they are there just as surely as any other movemeats. 
Guthrie particularly emphasizes the role of movement-produced 
stimuli, the sensations produced by our own movements. These 
play an important part in thought, purpose; the coordination of se- 
quences of behavior, and responses to stimuli that are no longer 
Present, Though they often function like intervening variables in 
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his system (since they are often impractical to observe directly), he 
hesitates to call them intervening variables since for him they are 
just as objectively present as the independent and the dependent 
variables. 

Guthrie’s final theoretical statement (Guthrie, 1959), written 
shortly before his death, is both more technical and more 
tentative than most of his earlier work. He seems to be trying to 
clarify both his own ideas and his telationships to other theo- 
rists. In this statement he is concerned both with the concept of 
attention and with the formal structure of his theory. However, 
it is likely that Guthrie will be remembered not so much for his at- 
tempts, successful or not, at formal theory building as for his in- 
formal contributions to our thinking about the learning process. 

In trying to understand or to control any learning situation, 
Guthrie reminds us to look at the particular response that is being 
made and the particular stimuli that are eliciting it. He warns us 
not to rely on vague exhortations, not to look for magic in the ad- 
ministration of rewards and Punishments, but to concentrate on 
eliciting particular patterns of behavior in Particular situations. 
Though he tends to draw his examples largely from child rearing 
and animal training, Guthrie has much that is useful to offer adult 
human learners as well. College students often complain that 
they know the material they have studied but that they somehow 
cannot present it on examinations, A familiarity with Guthrie’s 
thinking would lead one to say, “If the behavior you want to pro- 
duce is the behavior of writing essays about certain topics, prac- 


tice writing essays, and Practice in a situation as close as possible 
to that of an exam.” A simi 
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Guthrie’s approach can stand one in good stead in many learning 
situations. 


SUMMARY OF CONTIGUITY THEORY 


Watson and Guthrie of course have in common all the char- 
acteristics of behaviorism in the general sense of that term. In ad- 
dition, there is one respect in which the two of them differ from the 
other behavioristic theorists we will be discussing, namely, in that 
they make no use of the concept of reinforcement. Watson ridi- 
culed the idea that reward could determine what was learned, re- 
garding it as a magical notion unfit for a scientific explanation. 
(Pavlov spoke of the unconditioned stimulus as a reinforcer for 
the conditioned response, but Watson ignored this aspect of condi- 
tioning.) Guthrie similarly avoided making any reference to the 
reinforcing effects of rewards. In their systems, learning is as- 
sumed to depend only on the contiguity of stimulus and response, 
in other words, on the fact that they occur together. Hence Watson 
and Guthrie are called contiguity theorists: ; 

In taking this position, Watson and Guthrie stand in contrast 
to another group of behavioristic theorists, known as reinforce- 
ment theorists. This latter group are just as dedicated to objectiv- 
ity and just as much attached to the stimulus-response Janguage 
in describing learning. They do not, however, see any objection to 
Tecognizing the reinforcing effect of reward in their theories; in 
fact they consider this effect essential to the analysis of learning. It 
is to this group of connectionist theorists that we turn in the next 


chapter. 


REINFORCEMENT THEORIES 
IN THE CONNECTIONIST 
TRADITION 


THE IDEA THAT pleasure and pain as consequences of,our acts are 
important determiners of behavior has a distinguished history in 
psychology. It forms the basis of the theory of psychological 
hedonism that was developed by Jeremy Bentham and adopted 
by a number of other British philosophers. According to this view, 
we all do those things that give us pleasure and avoid those that 
give us pain. However, it remained for Edward L. Thorndike 
(1874-1949) to make a similar view central to the psychology of 


learning. 


THORNDIKE’S EARLY CONNECTIONISM 


Thorndike was a pioneer in experimental animal psychology. 
Instead of relying on stories about the intelligent feats of this or 
that animal, he took animals into the laboratory, presented them 
with standardized problems, and made careful observations of 
how they solved the problems. His monograph, Animal Intel- 
ligence, published in 1898, is one of the most renowned classics 
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this improvement was very gradual. Even after several experi- 
ences of opening the door by pulling the string, animals on a given 
trial would still spend considerable time in other behavior before 
pulling the string. This led Thorndike to conclude that the cat’s 
learning to pull the string involved not an “intelligent” under- 
standing of a relation between string pulling and door opening 
but a gradual “stamping in” of the stimulus-response connection 
between seeing the string and pulling it. 

At the time Thorndike published these Studies, they were radi- 
cal in two respects: their careful observation of animal behav- 
ior under controlled conditions and their concern with the grad- 
ual strengthening of stimulus-response bonds, They were Thorn- 
dike’s answer to the argument about whether animals solve prob- 
lems by reasoning or by instinct. By neither, said Thorndike, but 
rather by the gradual learning of the correct response. 
owever, another point is 
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by behaviorists for this way of talking about learning. Actually, 
however, he defined these terms in a quite objective way: “By a 
satisfying state of affairs is meant one which the animal does noth- 
ing to avoid, often doing things which maintain or renew it. By 
an annoying state of affairs is meant one which the animal does 
nothing to preserve, often doing things which put an end to it.” 
(Thorndike, 1913, p. 2) Thorndike says nothing here about the 
animal’s feelings, only about what the animal does. Thus he ad- 
heres to the concern of behaviorism with what individuals do. His 
language may sound subjective, but his meaning is as objective 
as Watson’s. Working at the height of the behaviorist movement, 
Thorndike had his disagreements with its extreme supporters, 
but actually he and they were close together in interests and ob- 
jectives. In the broader sense of the term, Thorndike was certainly 
himself a behaviorist. 

Later in his career, Thorndike modified the law of effect to make 
satisfiers much more important than annoyers. Reward, he de- 
cided, strengthens connections, but punishment does not directly 
weaken them. If punishment is effective at all in weakening the 
tendency to do something, it is because it produces variable be- 
havior and thus gives some new response a chance to be 
rewarded. This position, except for the emphasis on reward, 
sounds much like Guthrie’s. With this modification, the law of ef- 
fect became simply the now familiar statement (but not at all fa- 
miliar when Thorndike presented it) that satisfying consequences 
serve to reinforce stimulus-response bonds. 

Thorndike was a man of practical interests, and he took a spe- 
cial interest in the psychology of education. For many years 
he served on the faculty of Teachers College, Columbia Univer- 
sity. Throughout his professional life, his studies on the “pure” 
psychology of learning with both human and animal subjects were 
interspersed with studies on the applied psychology of education. 
His emphasis on specificity in learning and on the mechanical 
stamping in of stimulus-response connections has been both 

raised and condemned by educators over the years. (It is to him, 
indeed, that we owe the term “connectionism.”) For our purposes, 
however, these aspects of Thorndike’s work are too similar to 
Watson’s and Guthrie’s to require further comment here. Thorn- 
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dike was no less a pioneer of objective psychology than Watson; 
indeed his original contributions were quite likely more important 
than Watson’s. However, our concern here is that he incorporated 
within his objective psychology of learning the law of effect, 
and thus became the first real reinforcement theorist. 


SKINNER’S INTERPRETATION OF LEARNING 


Thorndike was not a particularly systematic theorist. He stated 
many principles of learning, but he did so in a rather casual way. 
His voluminous writings show little attempt at consistency in the 
details of formulation. For our purposes it will be more profit- 
able, rather than studying Thorndike in detail, to look at a quite 
similar contemporary viewpoint, that of B. F. Skinner (b. 1904). 
Of a later generation than Thorndike, Skinner received his doctor- 
ate of philosophy in psychology from Harvard in 1931 and after 
teaching at the Universities of Minnesota and Indiana, returned to 
Harvard as a professor in 1948. His relation to Thorndike is 
somewhat like that of Guthrie to Watson: not by any means a dis- 
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are born with a number of reflexes, and we acquire others through 
the process of conditioning. The knee-jerk reflex is an example. 
The learning of respondent behavior follows the pattern that we 
have referred to earlier as classical conditioning. A new stimu- 
lus is paired with the one that already elicits the response, and 
after a number of such pairings the new stimulus comes to 
elicit the response. The presentation of the old (unconditioned) 
stimulus during training may be considered the reinforcer, since 
without it learning will not occur. Thus the learning of 
respondent behavior in Skinner’s system is the same kind of 
learning that Watson assumed made up all learning. The only 
difference is that Skinner emphasizes the reinforcing role of the 
unconditioned stimulus. 

Skinner maintains, however, that most behavior is of a different 
sort. This kind he refers to as operant behavior. Whereas the dis- 
tinctive characteristic of respondent behavior is that it is in re- 
sponse to stimuli, the characteristic of operant behavior is that 
it operates on the environment. There is no particular stimulus that 
will consistently elicit an operant response. Skinner speaks of op- 
erant behavior as being emitted by the organism rather than 
elicited by stimuli. Most behavior is of this sort; walking, talking, 
working, and playing are all made up of operant responses. 

Skinner does not mean to say that operant behavior is not in- 
fluenced by stimuli. Much of his analysis of behavior is concerned 
with ways in which operant behavior is brought under the con- 
trol of stimuli. However, such control is only partial and condi- 
tional. The operant response of reaching for food is not simply 
elicited by the sight of food; it also depends on hunger, social cir- 
cumstances, and a variety of other stimulus conditions. In these re- 
spects it is in contrast to the respondent knee-jerk Tesponse, which 
is regularly elicited by a tap on the knee almost without regard 
to other conditions. Because of this distinction, Skinner does not 
consider it useful to think of operant behavior as made up of spe- 
cific stimulus-response connections in the sense that respondent 
behavior is. Whereas Guthrie analyzes every bit of behavior in 
terms of the stimuli that produce it, Skinner prefers to think of 
most behavior (the operant kind) as emitted by the organism, with- 
out bothering to consider the multitude of stimuli that have some- 
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thing to do with its occurrence. The difference between Guthrie 
and Skinner on this point is one of emphasis and convenience rath- 
er than of direct disagreement. Both agree that behavior depends 
on the total pattern of stimuli, external and internal, that are pres- 
ent when it occurs, but Guthrie prefers to emphasize this point for 
all responses, whereas Skinner prefers to ignore it in those cases 
where no one particular stimulus is crucial to the occurrence of 
the response. 

The learning of operant behavior is also known as conditioning, 
but it is different from the conditioning of reflexes. Operant con- 
ditioning is the same sort of learning that Thorndike described. Be- 
cause for Skinner this is by far the more important kind of learn- 
ing, this topic is one on which he and Thorndike are close to- 
gether. If an operant response (often called simply an operant) oc- 
curs and is followed by reinforcement, its probability of occurring 
again increases. Whereas for reflexes the reinforcer is an uncon- 
ditioned stimulus, for operants it is a reward (or, as Thorndike 
would say, a satisfier), Thus we may say that reward following 
an operant makes that response more likely to occur again. (Even 
though the stimulus for an operant is unknown, Skinner still of- 
ten refers to the operant behavior as a response.) This is the pat- 


tern of operant learning, which is to say, of most of the learning 
discussed by Skinner. 


Positive and Negative Reinforcers 


Although Skinner is largely concerned with positive reinforc- 
ers, he also recognizes the existence of negative reinforcers. Neg- 
ative reinforcers are aversive stimuli, ones that the individual com- 
monly seeks to avoid. Whereas reinforcement results from the 
occurrence of a positive reinforcer, it results from the termina- 
tion of a negative reinforcer. Electric shock, for example, is a neg- 
ative reinforcer because the termination of the shock is reinforc- 
ing. Thus a response can be reinforced either by Presenting a positive 
reinforcer or by removing a negative reinforcer. 

An important point about reinforcers, both Positive and nega- 
tive, is that they can be conditioned. If a stimulus occurs repeat- 
edly with a positive reinforcer, it tends itself to acquire the capac- 
ity to reinforce behavior. It is then called a conditioned positive 
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reinforcer. A sign reading “Restaurant” will serve as a condi- 
tioned positive reinforcer for a hungry man in a strange city, be- 
cause such signs have been associated with food in the past. Sim- 
ilarly, a stimulus that occurs with a negative reinforcer tends to be- 
come a conditioned negative reinforcer, as in the familiar case of 
the burnt child who learns to avoid the stove even when it is cold. 

The topic of negative reinforcement is obviously related to 
punishment, but the exact relation is not obvious. Negative rein- 
forcement results from the removal of a negative reinforcer, 
whereas punishment involves the presentation of a negative rein- 
forcer. What effects does punishment have, and how are these ef- 
fects produced? Skinner points out that punishment is not a very 
reliable way of preventing responses from occurring. Reinforce- 
ment increases the probability of a response, but punishment does 
not necessarily reduce the probability. When it does, the reduc- 
tion may result from any of three reasons. 

First, the aversive stimulus used as punishment is likely to 
have emotional effects. These emotional effects are respondents 
that are elicited by the aversive stimulus. These respondents are 
likely to be incompatible with the punished response, so that they 
reduce its probability of occurrence. For example, if scolding 
a child for eating forbidden candy makes him cry, it is likely also to 
stop his eating, since it is difficult to eat and cry at the same time. 
This effect, however, is temporary. When the aversive stimulus 
is removed the emotional effects soon dissipate. The punished 
behavior then occurs again, often at an even higher rate than be- 
fore punishment. So, although this effect of punishment is often 
useful in stopping undesirable behavior, it is not useful in keeping 
it from happening again. 

The second effect of punishment is an extension of the first. 
When a neutral stimulus is paired with an aversive stimulus that 
elicits a respondent, this is a conditioning situation. Whatever 
stimuli are present when punishment occurs thus have a chance 
to become conditioned stimuli for the emotional responses to pun- 
ishment. These conditioned stimuli would then produce emotional 
respondents in the absence of the original punishment. As a re- 
sult, the previously punished response would tend to be replaced 
by these competing responses, much as if punishment were still 
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being presented. In our previous example, the child would tend 
to be frightened (make emotional responses) as soon as he 
touched the forbidden candy that had previously been paired 
with punishment. This fear would inhibit his eating responses. 
This effect of punishment, though basically similar to the first, 
is thus more lasting in its effects. 

The third effect of punishment is an application of negative re- 
inforcement based on the conditioned aversive stimuli from the 
second effect. When the individual turns away from the condi- 
tioned aversive stimuli, this act removes these stimuli and provides 
negative reinforcement. He is thus reinforced for making a re- 
sponse (turning away) that is incompatible with the punished re- 
sponse. As a result, he learns to turn away instead of making the 
punished response. Once stimuli from the forbidden candy be- 
came aversive to the child, he was reinforced for anything that 
got him away from these stimuli. Since getting away from these 
stimuli is incompatible with approaching them, he was learning 
not to eat the forbidden candy. This process, somewhat similar 
to Guthrie’s interpretation of punishment, is what disciplinarians 
usually hope to accomplish by punishment. However, this effect, 
like the second one, lasts only until the conditioned aversiveness 
of the stimuli extinguishes. To maintain this effect, the disciplinar- 
ian must be prepared to give further punishments as needed to 
maintain the new behavior. 

In general, Skinner regards punishment as a poor method of 
controlling behavior. For one thing, it is deceptive, since the first 
of its three effects often makes it appear dramatically successful 
when in fact it has produced only a temporary effect. For another 
thing, the emotional behavior it produces is likely to be undesir- 
able from other points of view. Replacing misbehavior with cry- 
ing or anger is seldom a good solution. Finally, the emotional re- 
sponses may become conditioned to stimuli other than the ones 
the punisher wishes, including the stimuli of the punisher him- 
self. Thus punishment is both a rather unreliable technique for 


controlling behavior and a technique that is likely to have unfortu- 
nate side effects. 


a a 
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The Role of Stimuli 


Although stimuli do not elicit operants in the sense that they 
elicit respondents, they may determine whether or not any given 
operant will occur. A stimulus acquires this influence through the 
process of discrimination. If an operant is reinforced in the pres- 
ence of one stimulus but not reinforced when it occurs in the pres- 
ence of a different stimulus, the tendency to respond when the 
second stimulus is present gradually becomes extinguished, and 
a discrimination is formed. The operant will then occur in the pres- 
ence of the first stimulus but not of the second. Skinner refers to 
the first stimulus as an S? and to the second as an Sê (S delta). 
With regard to any operant, an S?” is a stimulus in the presence 
of which the individual has learned to make the response and an 
S* is a stimulus in the presence of which he has learned not to re- 
spond. However, the operant is still not elicited by the S? as a re- 
spondent is elicited, since occurrence of the operant depends on 
factors other than the S”. For example, a pigeon can be taught 
to peck a key when it is red (S?) but not when it is green (S°) by 
reinforcing with food the pecks of the red key but not those of 
the green key. However, the pigeon will show little tendency to 
peck the red key when completely satiated for food. The S? is a 
major determinant of the operant pecking response, but it does not 
produce it in the automatic way that a stimulus elicits a reflex. Un- 
der these conditions the operant is said to be under stimulus con- 
trol. 

Skinner is militantly opposed to those aspects of theory that 
involve assumptions about entities or processes that cannot be ob- 
served. He has sometimes denied, in fact, that his systematic treat- 
ment of behavior is a theory at all. Although Watson and Guthrie 
proudly claim to be talking only about what is physically real, 
Skinner points out that we cannot actually observe a habit or a 
movement-produced stimulus. Hence, for Skinner, such concepts 
are inadmissible. Here he even diverges from Thorndike, since a 
stimulus-response bond is something invisible assumed to exist 
inside the body. Skinner’s insistence on talking only about behavior 
and its external determinants, not about what may be going on 
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inside the body, has led to the statement that Skinner studies an 
“empty organism”; not empty in fact, of course, but empty so 
far as Skinner’s interpretations are concerned. 

Instead of talking about stimulus-response connections (except 
in the case of respondents), Skinner speaks of the rate at which a 
given operant is emitted under a given set of conditions. This lan- 
guage puts primary emphasis on the response, with the stimuli be- 
ing of importance only as they set conditions for the response to 
occur. Moreover, stimuli constitute only one of the factors that in- 
fluence the emission of operants; such factors as food depriva- 
tion (which Skinner does not regard as a stimulus) must also be 
considered. This language raises some question as to whether it 
is proper to call Skinner a connectionist, since he does not actually 
speak of connections being formed between stimuli and responses. 
However, his concern with specific responses, stimulus control, and 
reinforcement makes it more appropriate to discuss him as a con- 
nectionist reinforcement theorist than under any other heading. 


The Scheduling of Reinforcers 


Skinner’s antitheoretical bias makes it difficult to discuss his 
system apart from his research and its applications (a fact of 
which Skinner would be proud). His research has been conducted 
almost entirely in one version or another of an apparatus that has 
become known as the Skinner box. This varies in size and form 
according to the organism being studied, but basically it is simply 
a box (or, from the subject’s point of view, a room) containing 
a simple manipulandum (i.e., something the subject can manip- 
ulate) and a device for delivering reinforcers, The manipulandum 
may be a lever for rats to press, a key (i.e., a disc resembling a 
telegraph key) for pigeons to peck, a vending-machine plunger 
for humans to pull, or anything else appropriate for the kind of 
subject using it. The mechanism for providing reinforcers is typi- 
cally some sort of feeder, delivering food pellets to rats, grain to 
pigeons, or candy bars to humans. Other sorts of reinforcers may 
be used, however, from drops of water for thirsty rats to peep shows 
for monkeys and humans. In some cases escape from electric shock 
to the feet is used as the reinforcer, in which case no separate 
reinforcement dispenser is required. 
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The basic principle on which the box operates is that responses 
to the manipulandum produce reinforcers. These responses are 
called free operants, since the subject is free to emit them at his 
own speed. The rate at which the free operant is emitted is the re- 
sponse measure. Whereas other experimenters may study speed of 
running, number of correct choices, or other aspects of operant be- 
havior, Skinner and his followers study only the rate of emission 
of free operants. Whatever variable they manipulate, they study 
its effects in terms of this measure. However, as we have seen, 
there are many different free operants that can be studied, The 
important thing, says Skinner, is to find an appropriate operant for 
the kind of individual one wants to study, that is, an operant that 
the subject can emit conveniently and fairly rapidly. If this condition 
is met, the particular operant chosen makes little difference to the 
laws that will be found. We use levers for rats and keys for pigeons 
rather than vice versa because these manipulanda suit the re- 
sponse capacities of these animals, but we find similar laws for 
rats and for pigeons when each learns an operant appropriate for 
its own capacities. 

The rate at which the free operant is emitted (the dependent var- 
iable) can be related to a great variety of independent variables. 
In practice, however, Skinner and his followers have concentrated 
largely on one independent variable, the schedule of reinforce- 
ment. This term refers to the particular pattern according to 
which reinforcers follow responses. The simplest schedule is con- 
tinuous reinforcement, in which a reinforcer is given for every re- 
sponse to the manipulandum. This schedule is generally used 
when the subject is first being trained to use the manipulandum. 
After the response is learned, the schedule is usually shifted to some 
form of intermittent reinforcement, in which only part of the re- 
sponses are followed by reinforcement. Skinner has collaborated 
with Charles Ferster on a large book, Schedules of Reinforce- 
ment, describing different schedules of intermittent reinforcement 
and their effects, but fortunately for the student they tend to rep- 
resent variations on two basic patterns. If the frequency with 
which reinforcers are presented depends on the rate at which re- 
sponses are emitted, this is called a ratio schedule; if it depends 
simply on the passage of time, it is called an interval schedule. In 
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addition, each of these two kinds of schedule may be either fixed 
or variable. Combining these two bases of classification gives four 
main kinds of schedule. 

In a fixed-ratio schedule, the subject is reinforced after every 
so many reponses. Thus a reinforcer may be delivered after every 
fourth or every tenth or every twentieth Tesponse. A variable-ratio 
schedule differs from a fixed-ratio schedule in that the reinforcer, 
instead of being presented consistently after every so many re- 
sponses, is presented after a different number of responses on dif- 
ferent occasions. In this case the ratio is the average number of 
responses per reinforcer. Thus on a variable-ratio five schedule, 
reinforcers are delivered on the average after every five responses, 
but on one occasion two successive responses might be reinforced, 
while on another occasion the individual might have to make as 
many as ten responses after getting one reinforcer before getting 
another. y 

On a fixed-interval schedule, a fixed interval of time has 
to elapse after one reinforcer is delivered before another can be 
obtained. Once this interval has elapsed, the first response will be , 
reinforced. Thus on a fixed-interval one-minute schedule, the 
subject cannot obtain reinforcers oftener than one a minute re- 
gardless of how fast he responds. He can obtain one each minute 
equally well by responding rapidly all the time or by responding 
only once a minute. If he waits for awhile after the minute has 
elapsed before making the response, the reinforcement will be 
correspondingly delayed. A variable-interval schedule makes it 
possible to obtain a reinforcer sometimes sooner and sometimes 
longer after the previous one. Thus on a variable-interval two- 
minute schedule, the average time after presentation of one rein- 
forcer when another would become available would be two min- 
utes, but on any particular occasion the interval might be consid- 
erably shorter or longer. Hence the only way to be sure of getting 
all available reinforcers as soon as possible would be to respond 
continuously. 

How do these schedules differ in the patterns of responding that 
they produce? First, ratio schedules typically give higher rates of 
responding than interval schedules. This difference is not surpris- 
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ing, since fast responding on a ratio schedule increases the num- 
ber of reinforcements in a given period of time, whereas fast re- 
sponding on an interval schedule only serves to obtain each re- 
inforcement a little sooner. Second, on both kinds of fixed schedule 
there is a tendency for responding to be slowest immediately after 
reinforcement (more so than can be explained simply by the time 
required to consume the reinforcer). The reason is that responses 
immediately after reinforcement are never reinforced. Intuitively 
this effect is easier to appreciate in the fixed-interval schedule, since 
the individual has nothing to gain from responses during the 
interval. On a fixed-interval schedule, the resulting pattern is a 
gradual increase in rate of responding from just after one reinforce- 
ment to just before the next. Because it looks on a graph like the 
edge of the familiar sea shell, this effect is known as scalloping. The 
fact that there is a gradual increase in rate during the interval rather 
than a sudden burst of responding just when the next reinforcer is 
due reflects the subject’s inability to discriminate time perfectly. 

On a fixed-ratio schedule we might expect the individual to re- 
spond just as rapidly after a reinforcement as at any other time, 
since he has to make a certain number of responses before the next 
reinforcement regardless of when he makes them. This expecta- 
tion, however, reflects a view of the organism as figuring out the 
most profitable strategy and acting accordingly. According to Skin- 
ner, we must simply look at the reinforcement contingencies. On 
a fixed-ratio schedule, the first responses after a reinforcement 
are never reinforced as quickly as the later ones, hence they occur 
at a slower rate. However, this makes a difference only when the 
ratio is high enough so that the subject cannot make all the neces- 
sary responses quickly and get almost immediate reinforcement. 
Consequently, low fixed ratio schedules show very little drop in 
responding after reinforcement, while high fixed ratios show a 
sometimes lengthy pause after reinforcement. Finally, this scalloping 
does not occur with variable schedules. Since all responses, early 
or late, have a chance of being reinforced on a variable schedule, 
responding is at a constant rate except for the brief period that may 
be required for actually consuming the reinforcer. 

In general, organisms will make more responses per reinforcer 


70 REINFORCEMENT THEORIES 


on any kind of intermittent schedule than on continuous reinforce- 
ment. If reinforcement is finally terminated altogether, resistance 
to extinction is also greater after intermittent than after continu- 
ous reinforcement. To obtain rapid steady responding and high 
resistance to extinction, the variable-ratio schedule is the most ef- 
fective. It is possible, in fact, to get animals to work for food re- 
inforcement on ratios so high that they are actually operating at a 
biological loss: the energy expended in Operating the manipulan- 
dum is greater than that obtained from the occasional food re- 
ward, so that an animal can literally work itself to death, 

These schedules differ in their sensitivity to various disrupting 
factors. This fact has been of particular interest in connection with 
the effects of various drugs, and pharmaceutical companies have 
found it useful to test new drugs on animals responding to differ- 
ent schedules of reinforcement. One generalization which emerges 
from many studies is that interval schedules are more easily dis- 
rupted by a variety of drugs than are ratio schedules. Doses that 
will make responding on an interval schedule quite erratic will 
leave a ratio schedule largely unaffected. Apparently the rat’s 
counting mechanism is more stable or less vulnerable than its tim- 
ing mechanism. Though Skinner is pleased that his experimental 
techniques have been found useful for studying drugs, it is charac- 
teristic of his approach that he makes no attempt to draw infer- 
ences about what these drugs are doing to the animal's body. There 
is a lawful relationship between what drug goes into the body and 
what behavior comes out; what goes on between these two is no 
concern of Skinner’s. 

It should be noted that studies of different schedules of rein- 
forcement are not concerned with how the Tesponse is originally 
learned. Ordinarily the Subject is well trained to use the manipu- 
landum before any form of intermittent reinforcement is introduced. 
Thereafter a given individual can adapt fairly readily to one 
schedule after another, changing his behavior to suit each new 
schedule. Thus Skinner’s formal research has been largely con- 
cerned with a short-run aspect of learning, the rapid shifts in per- 
formance level to match shifts in reinforcement conditions. (Some 
writers, in fact, consider such shifts too short-run to be called 
learning at all, and refer to them simply as changes in perform- 
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ance.) These studies are concerned not with an individual’s learn- 
ing how to respond, but with his learning how rapidly to respond 
under a new set of reward conditions. 


Shaping 

We should not conclude from this emphasis, however, that 
Skinner has been uninterested in the process of learning how to 
perform complex tasks. Much of his less formal work has been 
concerned with this problem, and he has given striking demon- 
strations of training techniques. Though many experimental psy- 
chologists study animal learning, Skinner is almost unique in his 
concern with animal training. The technique by which he trains 
animals to perform complex acts that are outside their normal 
range of behavior is known as shaping. The behavior is shaped 
through a series of successive approximations, each made possi- 
ble by selectively reinforcing certain responses and not others. 
Thus behavior is gradually brought closer and closer to the de- 
sired pattern. 

Suppose you want to train a rat to press a lever to obtain a mar- 
ble, carry the marble to the other side of the cage, drop it in a hole, 
and then run to a third place in the cage to get food. If left to its 
own devices, the rat might never go through this particular se- 
quence of operations, and if it did the process would probably take 
so long that the food reinforcement at the end would have little ef- 
fect on the initial bar-pressing response. If you tried to train the 
rat by simply waiting until it went through the whole sequence 
spontaneously often enough to learn it, in principle you might suc- 
ceed, but the process would almost certainly exceed the span of 
your patience and might even exceed the life span of the rat. Yet 
this sequence is by no means at the limits of a rat’s learning ca- 
pacity; considerably harder ones have been mastered by rats with 
proper training. } : As 

To accomplish this training Skinner would begin by depriving 
the rate of food to the point where it was at about 80% of its nor- 
mal body weight. Then he would train the rat to eat from the 
a number of occasions a click or some other 
lowed by the delivery of food from the maga- 
procedure would be continued until the 


food magazine. On 
signal would be fol 
zine into a tray. This 
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click led the rat to go immediately to the food tray. This part of the 
procedure is called magazine training. Then the shaping could be- 
gin. At first the click might be sounded and the food delivered 
whenever the rat touched the lever. Soon the rat would be spend- 
ing much of the time between food deliveries close to the lever, 
and touching it frequently. Then the procedure could be changed 
so that food was delivered only when the lever was pressed. After 
the rat learned to press it regularly, food would begin to follow 
lever presses only if the rat, immediately after pressing the lever, 
touched the marble. In successive stages of training, the rat would 
get fed only if it picked up the marble, then if it moved it toward 
the hole, then if it put it into the hole, at which point training would 
be complete. Each stage of this shaping process would change the 
distribution of the animal’s total behavior in the box, making re- 
sponses that were originally rare occur more and more frequently. 
These responses, being closer approximations to the final perform- 
ance, could then be reinforced and all other responses nonrein- 
forced. Only after the rat learned to press the lever frequently, for 
example, was it practical to give reinforcement only when lever 
Pressing was followed by marble touching. This gradual changing 
of a subject’s typical behavior to bring it closer and closer to what 
is desired makes up the process of shaping. 

Skinner has given a number of striking demonstrations of the 
shaping of operant behavior. He has trained rats to go through 
Sequences of behavior even more complex and less ratlike than 
that described above. He has trained pigeons to play a modified 
version of ping-pong, pecking a ball back and forth across a table. 
With less complex Operant behaviors, he has carried out the en- 
tire shaping Process during a class period, with students watching 
the animal’s behavior gradually change under the influence of the 
reinforcement procedure. Such demonstrations of course require 
some degree of skill and experience in finding responses and rein- 
forcers that are appropriate for the organism being trained and in 
deciding how fast to go. Skinner, however, emphasizes the mechan- 
ical aspect of the procedure, the extent to which a variety of be- 
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principles, contributing nothing to our scientific understanding of 
the principles themselves. However, they serve to demonstrate the 
power of reinforcement in molding behavior, and this demonstra- 
tion is Skinner’s object. 

Not all of these demonstrations have been with animals. A 
technique for showing the automatic effect of reinforcement on 
human behavior is called verbal conditioning. In the first experiment 
of this sort (Greenspoon, 1955), the subject was instructed simply 
to say words. The experimenter gave the subject no clue in his 
instructions as to what sorts of words were desired. However, 
whenever the subject said a noun in the plural form, the experi- 
menter said “mmhm.” There was an increase during the session in 
the frequency with which subjects said plural nouns. This increase 
Occurred in spite of the fact that many subjects were quite unaware, 
so far as could be determined by questioning, either of the fact 
that they were saying more plural nouns or of any relation between 
the experimenter’s behavior and their own. When the “mmhm” was 
discontinued, frequency of plural nouns declined (extinction). 

In another less formal experiment (Verplanck, 1955) subjects 
were simply engaged in conversation without even being told that 
this was an experiment. The experimenter (who in this case might 
better be called the interviewer) expressed interest in and agree- 
ment with any of the subject's remarks that were presented as 
expressions of opinion (i.e., “I think ...,” OF It seems to me... ). 
Other kinds of remarks produced no reaction from the interviewer. 
Expressions of opinion became more and more frequent during 
the conversation. In both of these verbal-conditioning studies, ver- 
bal behavior was modified by reinforcing a given kind of verbal 
response and no other. Like any technique of operant rag pe 
One depends on finding a reinforcer that will be effective Aa r 
particular kind of individual being studied, but once one is foun 
the principle of reinforcement seems to apply as well here as 


elsewhere. 
_ However, the “automatic” nature of these ica been ar 
jected to a good deal of question. In studies of animals, no pro 


arises: one need only note the systematic ways that benanan ae 
influenced by reinforcement, and whether or not one 
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effects automatic is pretty much irrelevant. With humans, however, 
one can ask them what they thought about the whole procedure and 
thus try to determine whether they reacted automatically and 
largely unconsciously to the reinforcement or whether they noticed 
what kinds of responses were being reinforced and then decided 
whether or not to work for reinforcers. From Skinner’s point of 
view, the person’s awareness is not a particularly important issue— 
the important question is whether the reinforcement influences the 
behavior, not whether the person can accurately describe what is 
going on or whether he feels that he is making a free choice. How- 
ever, the issue of awareness in verbal Conditioning has aroused 
considerable interest among other psychologists. Although the issue 
is far from settled, the weight of current evidence suggests that 
humans who are influenced by verbal reinforcement are usually at 
least partly aware of what is going on and have some sense of 
choosing how they will react to it (see, €.g., Spielberger and 
DeNike, 1966). Just what significance this observation has for the 
notion that reinforcement is an automatic process is hard to decide, 
but at least it appears that the Process, though often partly uncon- 
scious, is rarely completely so. 

An interesting sidelight on the shaping of behavior is the way 
in which reinforcement can produce not only behavior that the 
experimenter intends but also behavior of which he has no 
advance idea. Suppose a timer is arranged to deliver a reinforcer 
every 30 seconds regardless of what the subject does. (This sched- 
ule is not the same as a fixed-interval schedule, where he has 
to make a particular response after the interval is up in order to 
be reinforced.) Whatever the individual is doing when the rein- 
forcement comes is more likely to occur again the next time. 
Purely on the basis of chance, he is more likely to be doing some- 
thing at that moment which he commonly does than something 
that he does more rarely. Given the fact that this behavior occurs 
commonly to start with, plus the fact that it has now just been re- 
inforced, it is all the more likely to be occurring when the next 
reinforcer is delivered. This reinforcement will strengthen it 
some more and make it even more likely to occur at the right time 
to receive the third reinforcement. Thus this particular behavior 
becomes more and more likely to occur because it is reinforced, 
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even though the experimenter did not deliberately reinforce that 
response rather than others. Rather, the learning was the result of a 
vicious circle: because the response occurred frequently, it was 
Teinforced, and because it was reinforced, it occurred more fre- 
quently. The experimenter did not know in advance which of the 
various responses that the individual made frequently would be 
learned in this way; that selection depended on chance. There might 
be a period during which several different responses were 
reinforced before any one gained enough of a lead to start the 
Vicious circle going. Behavior might be too variable for any one re- 
Sponse ever to get the necessary head start. However, when it oc- 
curs, this unplanned reinforcing effect is an impressive demon- 
Stration of the automatic operation of reinforcement. 

; Skinner refers to this kind of unplanned learning through “ac- 
cidental” reinforcement as superstitious behavior. The justifica- 
tion for this term is that the subject acts as though a certain be- 
havior produced reinforcement, when in fact there is no neces- 
Sary connection between the behavior and the reinforcement. 

he response is commonly followed by reinforcement only be- 
Cause both behavior and reinforcement occur frequently and 

ence often occur at the same time. The most successful experi- 
Mental demonstrations of superstitious behavior have been with 
Pigeons, but applications to human learning are not difficult to 

See. If a student carries a rabbit's foot into an examination 
for good luck, and does well, this experience will make him more 
ikely to carry it into the next examination. Repeated successes 
a carrying the rabbit's foot will make his adherence to the foot 
as a source of luck stronger and n though it same 
uted nothing to his success and ne a ee ae 
re it. (We overlook the possibility that the confidence 
rage gave the student may have inerea nf 
fies E the examination.) Many of man To 
admini: woe magic, b ut also in me on such superstitious 
iia strative techniques probably depend fu 

et A public speaker who thinks he is success ia pe b 
Pi three times on the po starting 

e i ay recognized as superstitio 

uccessful because he starts è 


us, 
ach lecture 
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may be equally superstitious by Skinner’s definition. In humans, 
such superstitions are probably oftener learned in the first place 
from other people than from chance occurrences. However, when 
people claim that their faith in them has been validated by experi- 
ence, this experience often follows the learning pattern that Skin- 
ner has described and illustrated. 


Some Applications 


As the reader has doubtless already guessed, Skinner has 
shown much interest in the application of learning principles to 
complex practical situations. He has written a book analyzing lan- 
guage as a system of operant responses (Skinner, 1957). He has 
pointed up the various forms of reinforcement used in political, 
social, and economic control (e.g., describing ordinary wages as 
fixed-interval and piece-work as fixed-ratio schedules for the control 
of economic behavior) (Skinner, 1953). Most ambitiously, he has 
described a utopian community called Walden II (in honor of 
Thoreau, who might or might not feel himself honored), in which 
the principles of learning are used to create a more ideal form of 
social organization (Skinner, 1948). Some critics regard these 
interpretations as highly speculative oversimplifications. They ask 
why Skinner, though refusing to make inferences from what a pigeon 
does to what is going on inside the pigeon, is quite willing to make 
inferences from what the pigeon does to what is going on in human 
Social organization. It must be admitted that Skinner’s interpreta- 
tions sometimes run far beyond his data. Skinner is both an in- 
genious experimenter and a prophet of the application of science to 
human problems, but the gap between these two roles sometimes 
looms large. 

Two of Skinner’s applications, however, have gone far beyond 
the realm of speculation. One of these is the treatment of behavior 
disorders. Whether the problem is minor misbehavior on the part 
of a relatively normal child or a psychosis that has kept a patient 
hospitalized for many years, Skinner’s approach to treatment is quite 
straightforward. Rather than focusing on early childhood, current 
psychodynamics, or possible organic abnormality, Skinner simply 
asks what this person is doing that we don’t like and what we 
would like to have him do instead. Once we have decided this, we 
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can proceed to extinguish the undesirable behaviors and reinforce 
desirable ones. In other words, we can change the contingencies of 
reinforcement, the specific relationships according to which rein- 
forcement is contingent on one or another behavior. This approach, 
basically similar to Guthrie’s but with the additional element of 
reinforcement, forms the basis of various behavior-modification 
techniques of psychotherapy. 

The behavior modification approach may be illustrated by a 
case in which a nursery-school child was cured of excessive crying 
(Hart et al., 1964). The therapists (experimenters? educators? ), 
like true Skinnerians, began by differentiating the child’s crying into 
respondent (usually produced by physical pain and not influenced 
by the social situation) and operant (less directly influenced by 
specific stimulation, but occurring when an adult was nearby and 
involving frequent glances at the adult, apparently to see how he 
was reacting). No attempt was made to modify the respondent cry- 
ing, but all the nursery-school teachers were trained to recognize 
and to ignore the operant crying. The attention from teachers that 
had previously reinforced the operant crying was thus removed, and 
extinction began. That the reduction in operant crying was really the 
result of the extinction procedure is indicated by what happened 
when the teachers were instructed to begin again paying attention 
to the child when he cried—there was rapid relearning of the crying. 
A second extinction was then begun, and the operant crying was 


virtually eliminated. 


Similar techniques have been used in mental hospitals and in a 


variety of other settings to deal with such diverse behaviors as 
over- and undereating, hoarding of towels (by a mental patient), 
inattention in school, and the extreme social withdrawal of autistic 


children and of many psychotics. (For further details, see Part II 


of Ulrich et al., 1966). From the point of view of more traditional 
psychotherapies, the behavior-modification approach is open to 
criticism as too superficial, with its effects lasting only as long as the 
new contingencies of reinforcement are in effect. However, as long 

havior modification can point to rapid, 


as the proponents of bel c 1 p 
measurable, and sometimes dramatically large changes in behavior, 


they are not likely to be greatly troubled by this criticism. Though 
it is too early to be sure, the changes in treatment of abnormal 
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behavior resulting from the behavior-modification movement may 
well turn out to be as great as those produced earlier by the rise of 
psychoanalysis. 

Skinner’s approach to psychotherapy obviously has much in 
common with education. It is therefore not surprising that the other 
of Skinner’s applications that has had important practical effects is 
in the field of education. This is the study of programmed learning, 
first popularized through its use in teaching machines. Though 
Skinner was not the first to suggest this approach to teaching, he 
gave the idea much of its early impetus. His object was to treat 
classroom learning like any other situation in which certain be- 
havior, in this case largely verbal behavior, is to be shaped. The 
student must progress gradually from familiar to unfamiliar 
material, must be given an opportunity to learn the necessary dis- 
criminations, and must be reinforced. The classroom situation has 
many disadvantages from this point of view. A rate of progress 
appropriate for one student is too fast or tod slow for another. 
Opportunities for each individual to make the required responses 
are limited, and reinforcement is often greatly delayed. Individual 
tutoring could solve all of these problems, but in most cases this 
is out of the question except perhaps for occasional supplementary 
work. What, then, can be done to give students in school the same 
advantages that pigeons in boxes have? Skinner’s answer to this 
question was the teaching machine. 

The basic component of the machine is the program. This is a 
series of combined teaching and test items that carries the student 
gradually through the material to be learned. An item may or may 
not convey new information to the student, but in any case it 
calls for him to fill in a blank in a statement. He can then look at 
the correct answer. If it agrees with his answer, this agreement con- 
stitutes the reinforcement. If not, he can study the correct answer 
so as to increase his chance of being reinforced next time. How- 
ever, Skinner prefers to make the learning requirements so gradual 
that the learner rarely if ever does make mistakes. If this effort is 
successful, then on every item the student makes a correct reponse 
and is reinforced, which in Skinner’s view is the best possible ar- 
rangement for learning. Individual differences are then reflected 
in the rate at which the student proceeds through the program. For 
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a discussion of the logic and the advantages of this approach to 
teaching, see Skinner, 1961. 

At this point the reader may be wondering, “Where does the 
machine come in? This is just the old, familiar workbook method.” 
To some extent this is a valid point. Much current programmed 
instruction is in workbook rather than machine form. The student 
fills in a blank, then turns to another page to check his answer. 
However, these programmed workbooks differ from the more famil- 
iar type of workbook in that they do all teaching through the items 
in the program, rather than serving to supplement lectures and 
textbooks. The same series of items that calls forth the student's 
responses also provides the information necessary for making the 
responses. This arrangement, in turn, forces the author of the pro- 
gram to plan the sequence of items very carefully in terms of just 
what he wants the student to learn and just how it can best be pre- 
sented. Whether the program is in a workbook or in a machine is 
of secondary importance. Machines have some advantages in 
speeding up the reinforcement and reducing the likelihood of 
cheating by the student. It is likely, however, that much of Skin- 
ner’s preference for machine presentation results from two fac- 
tors: (1) the novelty effect of the machine, which very likely makes 
its use more reinforcing to the student, and (2) Skinner's personal 
liking for mechanized procedures. Although the machine itself has 
sometimes been a bone of contention between those who favor in- 
creased classroom efficiency and those who fear the loss of more 
personal values in education, this is a misplaced emphasis. The per- 
tinent question is about the value of programmed methods of in- 
struction in whatever form, not about teaching machines as such. 

It is too soon to judge how valuable programmed teaching, by 
machine or otherwise, will prove in the whole context of educa- 
tion, Many factors besides efficiency as a specific teaching device 
are involved, Moreover, the increasing availability of computers 


Provides for more varied forms of presentation than either work- 
‘orm of teaching machine. Although 


book programs or the original f f 
Skinner favors linear programs, in which every learner goes through 


the same sequence of steps, which are easy enough so that mistakes 
are rare, computers have increased the use of branching programs, 
in which the learner’s answers determine what material he will 
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receive next. The technology of programmed instruction has now 
gone far beyond Skinner’s initial contribution, but it continues to 
provide striking evidence that Skinner’s applications are not 
restricted to tricks of animal training and to speculations about 
the organization of society. 


Skinner's Relation to Other Psychologists 


It is interesting at this point to notice Skinner’s relation to both 
Watson and Guthrie. He differs from both, of course, on the issue 
of reinforcement. He resembles them, however, in his practical 
emphasis, Like Guthrie, he is more at home analyzing particular 
situations of practical interest than discussing abstract issues of 
general theoretical significance. In analyzing a response, Guthrie's 
first question is, “What stimuli evoke it?” while Skinner’s is, 
“What reinforcer sustains it?” Both of these, however, focus the 
emphasis on a specific, manipulable detail of the situation. Guthrie 
has gone farther in relating specific situations to a general state- 
ment of what learning is, while Skinner has gone farther in actually 
experimenting with the situations he analyzes. Their differences 
are substantial, but their similarities are nonetheless noteworthy. 

As for Watson, he and Skinner have in common a missionary 
zeal about what psychology should be and what it should con- 
tribute to human affairs. Both react vigorously against what they 
regard as vague, overtheoretical interpretations of human nature 
and in favor of strictly scientific study of behavior. Both present 
systems that their friends regard as highly useful and their foes re- 
gard as highly oversimplified. (Both judgments may, of course, 
be valid, for Oversimplifications can be useful for many pur- 
poses.) Both appear as prophets, seeking to purge the errors from 
Psychology as it is and to proclaim the glories of psychology as 
it should be. Both have a vision of what man can become if guided 
by the proper application of the principles of learning. 

Though Skinner has gone farther than either Watson or Guth- 
rie in putting his ideas to the experimental test, he is still open 
to criticism for being too narrow in his interests, for concentrating 
on the application of a few principles rather than on a more gen- 
eral understanding of behavior. For example, he has concentrated 
on rate of emission of simple, free operants practically to the ex- 
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clusion of other response measures. He has used as reinforcers 
whatever worked, without asking what it is that makes a reinforcer 
reinforcing. He has refused to consider what it is inside the organ- 
ism that makes possible all the complex learnings of which the hu- 
man (and even the lower animal) is capable. To Skinner himself, 
these statements are not criticisms but compliments; they indicate 
how far he has gone in clearing away nonessentials and getting 
down to business. To many others, however, they imply that Skin- 
ner has failed to provide the sort of understanding that is one of the 
functions of theory. Whether Skinner is to be praised or condemned 
for these features of his work is a matter of one’s own values, but 
at any rate the disagreement suggests that there is a need for other 
approaches. 

Skinner has also been criticized on another basis. His experi- 
ments have typically been conducted on one or a very few sub- 
jects. Skinner believes that only by looking at the behavior of a sin- 
gle individual can one find the lawfulness in behavior. Many psy- 
chologists, however, take the contrary view: that stable, general 
laws can be obtained only by averaging the behavior of many in- 
dividuals, Only thus, they say, can individual differences and acci- 
dental fluctuations be ruled out so that the widely applicable, gen- 
eral laws remain. The arguments on both sides of this question are 
too complex to consider here, but they point up another reason why 
many psychologists are dissatisfied with Skinner’s system. They 
fear that Skinner substitutes ingenuity as a teacher for thorough- 
ness as a scientist, and that he magnifies peculiarities of individuals 
into what he claims are general laws of behavior. Again we see, not 
necessarily a weakness in Skinner, but a reason why other psycholo- 
gists have chosen other paths. 

As an experimenter, Skinner has made important contributions 
to the psychology of learning. Free operant behavior, schedules 
of reinforcement, shaping, superstitions, and the Skinner box 
are among these contributions. He shows little interest in being a 
theorist, important or otherwise. Nevertheless, he has made con- 
tributions in this area also. We may note, for example, the distinc- 
tion between respondent and operant behavior and the emphasis 
on precise control of behavior in the individual case. Not all of these 


contributions are regarded favorably by everyone, as we have seen. 
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There is, however, no question about the importance of Skinner’s 
contributions to both the pure and the applied psychology of learn- 


ing. 
MILLER’S INTERPRETATION OF LEARNING 


We turn now to another connectionist reinforcement position, 
that of Neal Miller (b. 1909), who was a professor of psychology at 
Yale through most of his career, but who moved to Rockefeller 
Univeristy in 1966. In principle Miller’s system is not very different 
from Skinner’s. It does, however, differ considerably in both 
vocabulary and experimental techniques. It is also more theoretical 
than Skinner’s. Actually it represents to a great extent a simplifica- 
tion of the theory of Clark Hull, which will be considered in detail 
later. Treating Miller before Hull, though it reverses the natural 
chronological sequence, may serve as an introduction to the highly 
technical study of Hull’s system. 

Much of Miller’s importance as a theorist comes from the ap- 
plications of his theory to topics in personality, social, and abnor- 
mal psychology. These applications were worked out jointly by 
Miller and John Dollard (b. 1900). Miller’s background in exper- 
imental psychology and Dollard’s in clinical psychology and the 
social sciences made possible a very fruitful collaboration on these 
topics. The theoretical analysis of learning is primarily Miller’s, 
with a heavy debt to Hull, but the applications are largely Dollard’s. 


The Four Elements of Learning 


Whereas Skinner makes the concept of reinforcement central 
to his interpretation of learning, Miller does the same with drive. 
A drive is an aroused state of the organism, one that goads the in- 
dividual into action. For Miller, a drive always involves a strong 
stimulus; moreover, any stimulus, if strong enough, acts as a 
drive, The drive stimulus may be either external or internal. Pain is 

_ an example of a drive produced by an external stimulus, while 
hunger and thirst are drives produced by internal stimuli. Some 
drives are produced by stimuli from an individual’s own emo- 
tional responses. When we are angry or afraid, physiological 
changes take place in our bodies. Some of these produce strong 
internal stimuli, which are responsible for the drives of anger and 
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fear. Whatever its source, a drive arouses the individual and keeps 
him active. Drive is thus the basis of motivation. 

In an infant, or in an older individual in a highly unfamiliar sit- 
uation, the activity produced by the drive has little direction. 
All sorts of different responses are made. It may happen that one 
of these responses serves to reduce the strength of the drive. If 
the drive is hunger, eating food will reduce it. If the drive is pain 
from electric shock, moving away from the electrically charged 
conductor will reduce it. When the drive is reduced (which often, 
but not always, means that it is completely removed), the individ- 
ual becomes less active. Thus one thing we say about drive is that 
a strong stimulus increases activity and the removal of the strong 
stimulus decreases activity. 

So far this is only a statement about motivation, not about 
learning. However, a reduction in the strength of a drive has a 
very important property: it reinforces whatever response came 
just before. Thus whatever response serves to reduce the drive is 
reinforced and therefore tends to be learned. As a result, the be- 
havior produced by drive in a familiar situation is quite different 
from the “trial and error” in an unfamiliar situation. The individ- 
ual now quickly makes the learned response that has reduced the 
drive in the past and thus immediately reduces it again. Drive re- 
duction is thus the basic operation in learning. 

So far we have a drive, a response, and a resultant reduction 
in drive. One more element needs to be added: the collection of 
stimuli that guides the response. One does not simply learn a re- 
sponse to reduce a given drive whenever or wherever the drive 
occurs; the response depends on conditions. Under some condi- 
tions one reduces hunger by going to a restaurant, under others by 
cooking a meal. The stimuli that guide the response and that deter- 
mine which response will occur are known as cues. Finally, the 
drive reduction that ends the learning sequence may of course be 
called a reward. This is the fourth of the elements in learning that 
Miller and Dollard have made famous: drive, cue, response, and 
reward. In Thorndike’s puzzle box, for example, the drive was 
hunger (plus, very likely, exploration), the most important cue 
was the loop of string, the response was pulling the string, and 


the reward was escape and food. 
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This position might be regarded as a combination of Guthrie 
on the one hand and Thorndike and Skinner on the other. A strong 
(drive) stimulus acts as a maintaining stimulus, in Guthrie’s 
terms. Its removal, however, acts as a reinforcer. Miller has used 
the term “drive” to include what Skinner would consider three dif- 
ferent concepts. One of these is the negative reinforcer. Miller’s def- 
inition of a drive is close to Skinner’s definition of a negative rein- 
forcer, since both are noxious and both provide reinforcement by 
their removal. The second concept is that of the maintenance 
schedule. The effectiveness of food as a reinforcer depends on the 
subject’s recent feeding schedule; if he has been deprived of food, 
food will be more reinforcing than if he has not. Maintenance, for 
Miller, is another aspect of drive. The third concept is that of 
arousal. A ‘number of stimuli will Produce the respondent behav- 
iors, such as acceleration of the heart and tensing of the muscles, 
that we call physiological arousal. For Miller, arousal is still an- 
other aspect of drive. Thus Miller has either advanced theory or 
confused the issue, depending on your point of view, by treating 
such diverse entities as hunger, pain, and emotional excitement 
under the single heading drive, 
Imitation 


In their first book, Social Learnin 


8 and Imitation, Miller and 
Dollard state their basic inter; i 
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is not rewarded and starts to extinguish. Thus imitative behavior 
is rewarded and other behavior not, so that the individual learns 
to do what he sees the other do. 

Miller and Dollard give an example of two little boys, aged six 
and three, playing a game with their father. The father hid a piece 
of candy for each and then the two boys looked for them, Wher- 
ever the older boy looked, his younger brother tagged along and 
looked also. When the older boy found his candy and stopped 
looking, the little one had no idea where to look next. The only 
strategy he knew in this game was to imitate his brother exactly. 

How had this pattern of imitation, so self-defeating in this 
case, been built up? Miller and Dollard refer to an earlier incident 
in the life of these brothers. On one occasion, the older boy heard 
his father’s step and ran to greet the father as he came home from 
work. The younger boy happened to be running in the same di- 
rection at the same time. The father met both boys with presents of 
candy. Since the younger boy had not learned to distinguish his 
father’s step, he was not running to greet his father; the fact that 
he was running was purely coincidental. Nevertheless, he was re- 
warded for the running. Naturally, there had been many times in 
the past when he had run and not been rewarded for it. The cue 
that made this time distinctive was the sight of his brother run- 
ning. Thus there was a learning situation for the younger boy, with 
hunger as the drive, the sight of his brother running as the cue, 
running as the response, and candy as the reward. It was through 
this incident and many others like it, Miller and Dollard decided, 
that the little boy eventually learned to imitate his brother. Hav- 
ing often been rewarded for imitating his brother, he did so even 
in situations such as the hiding game where it was useless. 

It is easy to think of such situations in which people are re- 
warded for imitating others. Miller and Dollard were not satisfied, 
however, merely to cite such examples. They went on to do ex- 
periments in which both humans and animals were taught to imi- 
tate. In one such experiment, the subject (a first-grade child) could 
get candy from a machine sometimes by turning the handle and 
sometimes by pressing it down. If the child made the wrong mo- 
tion, he got no candy on that trial. Paired with each child was 
another person, who took a turn at the machine just before the 
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subject did. Sometimes this other person was another child and 
sometimes an adult. If the adult turned the handle, turning the 
handle would also be correct for the child. If the other child 
turned the handle, however, only pressing the handle would work 
for the subject. In other words, in this situation the children were 
rewarded for imitating adults but not for imitating other chil- 
dren. The children not only learned to imitate the adult and not 
the other child, they also generalized this behavior to other adults 
and other children. Some other children were rewarded for imitat- 
ing children but not adults, and they too learned. This one experi- 
ment illustrates not only the learning of imitation, but also gen- 
eralization of imitation from one person to another and discrimi- 
nation between people to be imitated and people not to be. 

Miller and Dollard go on to apply these Principles to a variety 
of social situations. They point out, for example, that we learn 
to imitate high-prestige people more than those of low prestige. 
A child is likely to be rewarded for imitating “those nice (middle 
class) children” and not rewarded for acting like “those nasty 
(lower class) children from across the tracks.” The results of such 
learning to imitate high-prestige people may be seen in adolescent 
girls who try to dress and act like movie stars and in adult men 
who buy cars beyond their means in order to keep up with 
the Joneses. Miller and Dollard also apply the principles of imita- 
tion to the behavior of crowds. Mobs, they suggest, consist of 
people imitating one another and thus stimulating one another to 
deeds that few of the people would ever commit as isolated indi- 
viduals. This argument is illustrated with a gruesome story of a 
lynch mob. 


Fear and Neurosis 


In their second book, Personality and Psychotherapy, Dollard 
and Miller consider the learning of personality and particularly 
the learning and unlearning of neuroses. They begin by pointing 
out three characteristics of the neurotic person: that he is miser- 
able because of his conflicts, that he is stupid about certain as- 
pects of his life, and that he has symptoms. They then proceed to 
explain these three characteristics according to the neurotic’s pre- 
vious learning. The crucial element in this learning is the learned 
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drive of fear. This drive is the basis of the conflict, the source of 
the misery, and the cause of the stupidity. 

What is a learned drive? Skinner would call it a conditioned nox- 
ious stimulus. Dollard and Miller, however, go farther in analyz- 
ing what they think goes on when a drive is learned. They begin 
their discussion with an experimental demonstration involving 
rats, A rat is placed in a box with two compartments, one of which 
has white walls and a grid floor, the other black walls and a 
wooden floor. The rat explores both parts and shows little prefer- 
ence among them. Then it is placed in the white compartment and 
given strong electric shock through the grid floor. Most rats soon 
escape the shock by running into the black compartment. This se- 
quence of shock in white and escape to black is repeated several 
times. Then the rat is placed in the white compartment without 
shock. It runs rapidly to the black compartment. Since there is no 
longer a pain drive to motivate this escape behavior, Dollard and 
Miller explain it on the ground that a secondary drive of fear has 
been conditioned to the cues of the white compartment. 

What is the nature of this secondary drive? Like all drives, say 
Dollard and Miller, it involves strong stimulation. These strong 
stimuli are produced by the rat’s own responses. When the rat was 
first shocked in the white compartment, it made a variety of re- 
sponses to the shock, such as tensing the muscles, increasing the 
heart rate, and other such indicators of emotion. (These are not, 
of course, voluntary responses in either rats or humans. Skinner 
would call them respondents.) These in turn produced strong stim- 
ulation which was added to the drive produced by the pain of 
the shock. These strong stimuli produced by the emotional re- 
sponses make up the drive of fear. These emotional responses be- 
came conditioned to the cues of the white compartment, ow- 
ing to the fact that they occurred in the presence of those cues and 
were followed by drive reduction when the rat escaped the shock. 
Now when the cues of the white compartment are presented without 
shock, they produce the emotional responses which in turn produce 
stimuli of the fear drive. 

The fact that fear is a drive can be demonstrated by using it 
as the basis of new learning. This demonstration was done by clos- 
ing the door from the white to the black section and making it 
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possible for the rat to open the door only by turning a wheel on 
the wall near the door. The animals learned to turn the wheel and 
then run into the black compartment, even though there was no 
shock in the white compartment. In this case fear was the drive, 
sight of the wheel the cue, turning the wheel the response, and 
escape from fear (by escaping from the white compartment) the 
reward. Then the situation was changed so that turning the wheel 
would not open the door but pressing a lever would. As a result, 
the wheel-turning response then extinguished and the lever- 
pressing response was acquired. Thus the secondary drive of fear 
operated like a primary drive to motivate learning. 

How does this kind of learning occur when we go from experi- 
mental rats to neurotic humans? Let us consider a child who is 
severely punished for any kind of self-assertive behavior. When- 
ever he tries to get his own way, he is subjected to pain, which pro- 
duces emotional responses, which produce the secondary drive of 
fear. These fear-producing responses become conditioned to the 
cues which are present at the time, including the cues that come 
from his own self-assertion. As a result, any self-assertive behavior 
comes to produce fear, while submissiveness reduces the fear. 
The child is then afraid of self-assertion in the same way that the 
rat is afraid of the white compartment, and the child's submissive 
behavior is an escape from this fear-provoking self-assertion just 
as the rat’s running to the black compartment is an escape from 
the fear-provoking white compartment. 

However, the child’s problem is worse than the rat’s. As long 
as the rat is able to run to the black compartment, his fear is brief 
and does not disrupt his life. The child, however, is often placed 
in positions where self-assertive behavior might get him things 
that he wants. These positions occur even more often after he be- 
comes an adult. The fact that he is afraid to be self-assertive is a 
great handicap in such situations. He is placed in a conflict be- 
tween his desire for something and his fear of the self-assertive be- 
havior that would get it for him. This conflict, in which he loses 
no matter what he does, is a source of misery. If he recognized the 
great difference between his present situation and that in which he 
was punished for self-assertion, he might be able to relieve the 
fear and resolve the conflict. However, this course requires that 
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he recognize his problem and think about it. But he cannot do 
either since he has become afraid not only of behaving self- 
assertively but even of saying (to others or to himself) that 
he would like to be self-assertive. It is this fear of saying or even 
thinking that he would like to be self-assertive that makes his be- 
havior stupid. He cannot make the thinking responses that would 
help him to understand and to solve his problems. He can, how- 
ever, obtain some relief in a variety of ways. He may, for exam- 
ple, become so dependent, perhaps through symptoms of appar- 
ent physical illness, that others will feel obliged to take care of him 
and provide him with some of the things he wants. This would 
not be a deliberately adopted policy but a response learned 
through its drive-reducing effects. The “illness” would be called a 
symptom. It is a partial solution to the problem posed by fear and 
conflict, but it is only a partial solution, and it interferes with find- 
ing a more effective solution. This individual has all the character- 
istics of the neurotic; he is miserable, in conflict, stupid about his 
troubles, and has symptoms. 

How can the neurosis be eliminated through psychotherapy? 
Since fear is the crucial cause, extinguishing fear is the crucial 
element of the cure. If the rat is given enough experience in the 
white compartment without shock, its tendency to make the fear- 
producing responses will eventually extinguish. Similarly, if the 
neurotic can be persuaded to make self-assertive responses (or do 
whatever else it is that he is afraid of) under conditions where he 
will not be punished, his fear will extinguish. Since the cues for 
fear come from the individual’s own responses, he must gradually 
be induced to make these responses, first in very weak and in- 
direct form, later more directly and strongly. Thus in the early 
stages of therapy the neurotic may timidly say that he sometimes 
thinks he could make helpful suggestions to his boss, while in the 
later stages he may express a violent desire to tell the boss off with 
assorted insults. As the patient extinguishes his fea of making 
self-assertive statements, he becomes more able to think sensi- 
bly about his conflict. Since talking about an act and doing it are 
somewhat similar, he also becomes by stimulus generalization 
less afraid of overt self-assertive behavior. Thus Dollard and 
Miller, through a considerably different theoretical system, come to 
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a practical conclusion very much like Guthrie’s threshold method 
for the elimination of undesirable emotional habits. 


SOME RELATIONS AMONG INTERPRETATIONS 


We have now considered several theories of learning, all of 
which are parts of the behavioristic tradition in the broad sense. 
They have two things in common. One is that they are connec- 
tionist theories, concerned with the connections of stimuli with 
responses. (Skinner is to some extent an exception, but even though 
he does not regard stimuli as eliciting operant behavior, he does 
regard stimuli as important in controlling it.) The other similarity is 
that all are of a relatively simple, informal sort. This quality does 
not necessarily mean that they are easy to understand. It does 
mean, however, that they make little use of formal definitions, ab- 
stract symbols, or quantitative equations. They are stated in ordi- 
nary language, with few if any technical terms, rather than in the 
special languages of mathematics or symbolic logic. These two 
characteristics set them apart from the other theories that we will 
consider in later chapters. 

In what ways do these theories differ from one another? One 
split is between the contiguity theorists (Watson and Guthrie) and 
the reinforcement theorists (Thorndike, Skinner, and Miller). 
However, this difference is not as clearcut as it may seem. When 
Guthrie says that we learn responses which take us away from 
the maintaining stimuli, and Miller says that we learn responses 
which reduce strong stimulation, it is hard to see much difference 
between them. Guthrie does not, however, say that changes in 
maintaining stimuli are the only stimulus changes that can result 
in learning, so there is still some difference between him and 
Miller. We might think of Guthrie’s contiguity principle and Mil- 
ler’s drive-reduction principle as two different (but not com- 
pletely different) attempts to explain why we tend to learn those 
responses that are rewarded. Thorndike and Skinner, on the other 
hand, are not concerned with such explanations; they simply ac- 
cept the principle that rewards are reinforcing and go on from 
there. Miller concludes that all learning requires reward; Guthrie 
concludes that reward is probably not thus required, and that any- 
way it is more worthwhile to talk about contiguity than about 
reinforcement. 
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Another difference among these theories is in the extent to 
which they make statements about what is going on inside 
the body. Watson, Guthrie, and Miller explain all sorts of things 
in terms of the internal responses that individuals make and the 
stimuli that these responses produce. Thorndike and Skinner make 
little use of such explanations; Skinner in fact argues that all such 
explanations are inappropriate. The borderline is again difficult to 
draw, however. Skinner is quite willing to consider verbal responses, 
even when they are very quiet, and emotional respondent behav- 
ior, even when instruments are required to record it. Watson, Guth- 
rie, and Miller consider the stimulus-producing responses that 
they talk about just as physically real as those that Skinner will 
admit. The difference is thus only that Skinner will not talk about 
these responses unless he can point to them, whereas Watson, 
Guthrie, and Miller are willing to assume that they are there even 
when they cannot say exactly where they are and what they look 
like. . 

On the whole, the similarities among these interpretations of 
learning are more marked than their differences. This will 
be more apparent when we consider a markedly different group 
of interpretations—the cognitive ones. It is to these that we turn 
in the next chapter. 


COGNITIVE INTERPRETATIONS 
OF LEARNING 


ONE YEAR BEFORE Watson published his first challenge to Amer- 
ican psychology, Max Wertheimer (1880-1943) published a chal- 
lenge to the established psychology of Germany. The orthodoxies 
against which these two revolts were directed were, as we noted in 
Chapter 2, much alike. Both the American and the German ver- 
sions were largely concerned with the structure of the mind. 
They tried to analyze conscious thought into its fundamental 
units, such as sensations, images, and ideas. Particularly in Amer- 
ica, there was some trend toward studying behavior for its own 
sake, but psychology was still regarded as primarily the study of 
conscious experience. Experimentation was directed toward a 
more complete analysis of the contents of consciousness. 

The forms which these two revolts took, however, were strik- 
ingly different. Watson’s objection was that psychology should 
not be concerned with consciousness, but with behavior. He 
wanted to abolish the discussion of images and ideas in favor of 
a discussion of stimuli and responses. He still agreed with the ear- 
lier position, however, in being interested in analysis. He still wanted 
to work with fundamental units, though they were now units of be- 
havior instead of units of consciousness. 

Wertheimer, on the other hand, objected to the concern with 
analysis. It seemed to him that breaking consciousness into its parts 
destroyed what was most meaningful about it. He had none of 
Watson’s objection to the study of consciousness; indeed con- 
sciousness was his main concern. What he wanted to do was to 
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study consciousness as it appears in wholes, rather than break it 
down into parts. To the traditional psychology, anything we look 
at is a mosaic of tiny patches of color. Only as all of these 
tiny patches are put together do they make up the scene we ob- 
serve. Wertheimer challenged this view. We actually see the scene, 
he insisted, as a meaningful whole. Only by a very artificial proc- 
ess of analysis can we break down this whole into patches of dif- 
ferent colors and shades. The same applies to thinking. The tradi- 
tional psychology regarded all our thoughts as made up of im- 
ages connected by a process of association. This breakdown, too, 
Wertheimer rejected. Our thoughts are whole meaningful percep- 
tions, not associated collections of images. 

Wertheimer’s first publication in this revolt against analysis 
was concerned with the phenomenon of apparent movement. It is 
well known that we see a light as moving from one place to an- 
other when in fact what has happened is that a light in one place 
was turned off and one in another place immediately turned on. 
This illusion is the basis of the apparent movement in lighted ad- 
vertising signs. Prior to Wertheimer, this phenomenon had been 
regarded as a minor curiosity of no theoretical importance. To 
Wertheimer, however, it was striking evidence of the futility of 
analyzing a whole into its parts. The components were two sepa- 
rate lights going on and off, but the resulting whole was an impres- 
sion of movement. The observer does not see the two lights flash- 
ing and infer that something is moving; the impression of move- 
ment is immediate and direct. This phenomenon of apparent 
movement so impressed Wertheimer that he named it the phi phe- 
nomenon and began a series of studies on it. 


EARLY GESTALT PSYCHOLOGY 


The phi phenomenon was only the starting point of an intel- 
lectual movement within German psychology. This movement 
was primarily concerned with perception, but came to include 
learning and other topics as well. Its emphasis was on whole sys- 
tems in which the parts are dynamically interrelated in such a way 
that the whole cannot be inferred from the parts taken separately. 
Wertheimer applied the German word Gestalt, which may be 
roughly translated as “form” or “pattern” or “configuration,” to 
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these dynamic wholes. Such Gestalten (the plural of Gestalt) are of 
many sorts, and they occur in physics as well as in psychology. We 
have already considered the phi phenomenon as one example. A 
melody is another, since it depends on the relation between the 
notes rather than the notes themselves. “Die Lorelei” (to take an 
appropriately German example) is still the same tune when trans- 
posed into another key so that every note is different. A whirlpool 
is a third example, since it is a whirlpool not because of the par- 
ticular drops of water it contains but because of the way the mo- 
tion of the water is patterned. Because of this concern with Gestalt- 
en, the movement that Wertheimer started came to be known 
as Gestalt psychology. 

The emphasis of the gestalt (as an English word it needs no 
capitalization) psychologists on unified wholes does not mean 
that they never recognized separateness. Indeed, a gestalt may 
be referred to as a segregated whole. Of particular interest was the 
way that gestalten come to stand out as distinct entities separate 
from the background against which they appear. This interest was 
expressed in the concepts of figure and ground. The figure in any 
perception is the gestalt, the entity that stands out, the “thing” 
we perceive. The ground is the largely undifferentiated back- 
ground against which the figure appears. A melody, for example, 
is a figure against a ground that includes many other sounds. 
What appears as figure at one moment may not at another. If the 
listener stops paying attention to the melody in order to hear what 
his friend is saying, his friend’s speech becomes figure and 
the melody becomes part of the ground. Such changes in figure- 
ground relationships play a part not only in perception but in 
learning and thinking as well. 

It is of course possible to analyze a gestalt figure into compo- 
nent parts. The fact that three black dots on a white page appear as 
a triangle does not keep them from still being three dots. However, 
the important thing to a gestalt psychologist is that what we see 
immediately is a triangle. Afterward we can analyze the triangle 
into three dots and study what it is that makes these three dots ap- 
pear as a triangle when another three, differently placed, do not. 
We cannot say, however, that the triangle is nothing but three dots. 
The triangularity, which depends on. the pattern of the dots rather 
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than on the dots themselves, is the most essential aspect of what 
we see. The gestalt figure is more than just the sum of three dots. 
This relationship is the basis of one of the watchwords of gestalt 
psychology: “the whole is more than the sum of its parts.” 

It is evident that Wertheimer and Watson, though rebels against 
similar traditions at the same time, were moving in opposite di- 
rections. Each may be considered both a pioneer and an extreme 
prototype of a certain approach to psychology. Watson’s was the 
mechanistic approach, concerned with the components of behav- 
ior and the connections between them. Wertheimer’s was the dy- 
namic approach, concerned with unified patterns in consciousness. 
Without judging their importance relative to that of other men, 
we can say that in terms of conspicuous intellectual movements, 
Watson was the outstanding pioneer of connectionist theory and 
Wertheimer the outstanding pioneer of cognitive theory. 

From the first, however, Wertheimer shared the spotlight with 
two of his colleagues, Wolfgang Köhler (b. 1887) and Kurt Koffka 
(1886-1941), both of whom eventually surpassed Wertheimer 
as publicists for the new movement. These two men wrote a num- 
ber of books on different aspects of gestalt theory. Together with 
Wertheimer, they formed the nucleus of a group that became 
known as the Berlin school. All three, however, eventually moved 
to the United States. Perception remained their primary interest, 
but learning was by no means neglected. Throughout, the emphasis 
was on organized wholes, separated from other wholes but united 
within themselves by their dynamic patterning. 

The interpretations of learning presented by Wertheimer, Köh- 
ler, and Koffka tend to be presented in the terminology of percep- 
tion. Instead of asking, “What has the individual learned to do?” 
the gestalt psychologist is likely to ask, “How has he learned to 
perceive the situation?” Gestalt interpretations thus make an in- 
teresting contrast with the connectionist interpretations we have 
discussed earlier. It is true that gestalt theorists speak of memory 
traces, which are the effects that experiences leave in the nervous 
system. However, these are different from the stimulus-response 
bonds discussed by connectionist theorists. The memory traces of 
the gestalt psychologists are not isolated elements, but organized 
wholes, in other words, gestalten. Consequently, learning is not 
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primarily a matter of adding new traces and subtracting old ones, 
but of changing one gestalt into another. This change may oc- 
cur through new experience, but it may also occur through think- 
ing or through the mere passage of time. The way in which these 
restructurings occur is the concern of gestalt learning theory. 


Studies of Insight 


The most important contribution of gestalt theory to our un- 
derstanding of learning is in the study of insight. Often learning 
occurs suddenly with a feeling that now one really understands. 
Such learning is likely to be especially resistant to forgetting and 
especially easy to transfer to new situations. We speak of 
such learning as involving insight. In such cases the gestalt lan- 
guage of perceptual reorganization is particularly applicable. The 
learner who has insight sees the whole situation in a new way, a 
way which includes understanding of logical relationships or per- 
ception of the connections between means and ends. 

Such insight is by no means restricted to humans. During 
World War I Köhler, technically interned in the Canary Islands, 
did extensive studies of insightful problem solving in apes. These 
are described in his book The Mentality of Apes. He presented 
apes with problems in which bananas were displayed out of reach 
and could be obtained only by using techniques new in the apes’ 
experience. For example, a banana might be hung from the top 
of the animal’s cage, with boxes elsewhere in the cage which could 
be piled under the banana so that the ape could climb up and get 
it. Or a banana might be outside the cage, far enough away so that 
it could be obtained only by pulling it in with a stick. Such ar- 
rangements had the advantage, from the gestalt point of view, of 
making all the necessary elements of the solution visible to the an- 
imal, which is not the case in a puzzle box or maze. He found that 
these problems not only were often solved suddenly, but fre- 
quently were solved immediately after a period of time during 
which the ape was not actively trying to reach the banana. Some- 
times it appeared that the animal, having failed to obtain the ba- 
nana by familiar methods, sat and thought about the problem and 
then suddenly saw the solution. Such incidents are well suited to 
a description in terms of perceptual restructuring. Köhler could 
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say that an ape suddenly saw the boxes, for example, not as play- 
things to be tossed around but as supports to be climbed on. It 
saw the relationship between the boxes and the bananas. During 
the time that it was not actively doing anything about the prob- 
lem, the ape was undergoing a process of restructuring which, 
when complete, made an immediate solution possible. 

It should not be supposed, however, that only such dramatic 
examples of sudden and complete insight can be explained in ge- 
stalt terms. Gradual learning by trial and error can be interpreted 
as a series of small, partial insights. Köhler’s problems were so ar- 
tanged that the ape could see all the necessary elements of the so- 
lution at once. All that was necessary was for these parts to be- 
come organized into an appropriate gestalt. A rat in a maze, on 
the other hand, cannot see any relation between the pattern of 
turns and the food at the end until this relation has been discov- 
ered by experience. The rat’s restructuring must therefore be 
gradual and piecemeal, since the situation permits no other kind. 
Nevertheless, the discovery that a certain pattern of turns is the 
way to food is no less a cognitive restructuring than the discovery 
that boxes piled on one another are the way to food. The sudden- 
ness of the restructuring depends on the problem and the way it is 
presented to the subject, but the principle is the same. 


Gestalt Laws of Learning and Forgetting 


Insight requires that certain aspects of a situation be seen in 
relation to one another, that they appear as a single gestalt. What 
factors determine whether this event will occur? In his book, 
Principles of Gestalt Psychology, Koftka pointed out that the same 
Principles could be used to answer this question both in complex 
problem-solving situations and in very simple perceptual situa- 
tions. He suggested, therefore, that certain laws of perception pro- 
posed by Wertheimer should also be taken as laws of learning. 
We will consider two of these: the law of proximity and the law 
of closure. 

The law of proximity, as applied to perception, refers to the 
way in which items tend to form groups according to the way they 
are spaced, with the nearer ones being grouped together. For ex- 
ample, if a number of parallel lines are drawn on a sheet of paper, 
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with alternate wide and narrow spaces between them, the pairs 
with narrow spaces between will be seen as groups of two (see 
Fig. 1, Part A). It is these that are seen together, rather than the 
pairs with wide spaces between, because of their closer proxim- 
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Fic. 1. The laws of proximity and closure. The parallel lines in part A 


form three sets of two, because of their proximity. In part B they form 
two sets of two with an extra one on each end, because of the closed 


figures formed by the middle pairs of lines. 


ity to one another. This law also applies to spacing in time. Sounds 
close together tend to be heard as units. International Morse 
Code takes advantage of this principle by using intervals of si- 
lence of different lengths to separate letters and words, thus mak- 
ing these groups of sounds stand out as units. As applied to learn- 
ing, the law of proximity may again refer to closeness either 
in space or in time. In reference to space, it would explain why it 
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is easier for an ape to discover that he can reach a banana with a 
stick if the stick and the banana are on the same side of the cage. 
(The reader may think that this is really an example from percep- 
tion rather than learning, since the ape sees stick and banana as 
going together. If so, the reader has grasped part of the spirit of 
gestalt theory, in which perception and learning are inextricably 
tied up together.) In reference to time, it would explain why it is 
easier to remember recent events, which are closer to the present 
in time and hence more easily joined with the interests of the pres- 
ent in a common gestalt. 

The law of closure states that closed areas more readily form 
units. Applied to perception, this fact can be seen by referring to 
the previous example of parallel lines. It is possible to change the 
grouping and make the more widely separated pairs of lines ap- 
pear as groups. This change may be accomplished by connecting 
the ends of these lines so that they form two sides of a box. The 
connecting lines need not be complete; so long as the more widely 
separated pairs of lines appear to be parts of a figure that enclose 


space, they will tend to be seen together (Fig. 1, Part B). Applied 
to learning, the law of closure plays the i iti 


d. However, the relation be- 
Osure of a geometric figure 
d be more accurate to say, not 
t closure in perception to learn- 
ing, but that he has called a Jaw about perception aa a law about 
learning by the same name. 
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It is interesting to note that, for all the enormous difference be- 
tween Guthrie’s interpretation of learning and that of the gestalt 
psychologists, there is a certain similarity between Guthrie’s pri- 
mary law of conditioning and the gestalt psychologists’ law of 
closure. Both regard reward as producing its effect by the way 
that it changes the situation for the learner. For Guthrie, the re- 
ward changes the situation so that the last response which oc- 
curred remains conditioned to the stimuli of the situation. For the 
gestalt theorists, reward changes the individual’s perception of the 
situation so that the stimuli, the response, and the reward form a 
gestalt. From the point of view of a reinforcement theorist, this 
feature would justify classifying Guthrie’s theory and gestalt the- 
ory together as contiguity (as opposed to reinforcement) theories. 

The gestalt interpretation of forgetting, like that of learning, is 
concerned with perceptual changes. The memory trace tends to 
change spontaneously with time into a “better gestalt.” The con- 
cept of a good gestalt is a rather difficult one to explain. It is the 
pattern of organization which a system tends to adopt, whether 
the system is a soap bubble or a perception. Thus, a soap bubble 
tends to adopt the form of a sphere; if forced into a different form, 
it will tend to become a sphere as soon as the force is removed. 
Similarly, perceptions tend to adopt certain forms as closely as 
the conditions of stimulation permit. Good gestalten tend to be 
simple and regular. In the case of physical gestalten, it is often pos- 
sible to describe the characteristics of a good gestalt quite pre- 
cisely with mathematical equations (for example, the formula for 
a sphere as applied to the soap bubble). As we move into the realm 
of perception, previous experience begins to play a part in deter- 
mining what is a good gestalt. Familiar, meaningful forms tend to 
be better gestalten than unfamiliar, meaningless ones. Innate fac- 
tors are still of major importance, however, in determining what 
constitutes a good perceptual gestalt. When we consider the topic 
of learning, we find experience still more important, but innate 
perceptual criteria still apply. 

It is not surprising, in view of this interpretation, that the most 
famous study of forgetting by a gestalt psychologist (Wulf, 1922) 
was concerned with the forgetting of visual figures—simple line 
drawings. The subjects in this experiment were asked to look at 
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the drawings and try to remember them, and then at various later 
times were asked to draw them from memory. Many differences 
appeared between the original drawings and the reproductions. In 
some cases the reproduction was simpler and more regular than the 
original; in other cases some salient detail of the original was accen- 
tuated in the reproducticn; in still other cases the reproduction was 
more like some familiar object than the original had been. (None of 
the original drawings was clearly a picture of anything, but subjects 
saw many resemblances to familiar objects or patterns.) In all of 
these different kinds of changes, however, the experimenter saw a 
trend toward a clearer or more consistent figure, at least as it ap- 
peared to the particular person—in other words, a trend toward 
a better gestalt. In trying to remember ‘the original figure accu- 
rately, the subjects actually succeeded in remembering an “im- 
proved” version of it. Forgetting was thus not simply a loss of de- 
tail, but rather a distortion of what was physically present in the 
original drawing into something else that constituted a better ge- 
stalt. 


Insight in Education 


Wertheimer’s most noted contribution to the development of 
Gestalt psychology, once the enterprise that he started was under 
way, was in applications to education. He was concerned with in- 
sightful learning in school children. Whereas Köhler studied insight 
in apes for theoretical reasons, Wertheimer also had a very practi- 
cal interest in the topic in school children. It seemed to him that 
teachers put far too much emphasis on rote memorizing at the 
expense of understanding. He therefore directed his studies to- 
ward finding ways in which learning could take place with greater 
insight on the part of the learner. 

In his book Productive Thinking, Wertheimer makes a distinc- 
tion between two types of attempted solutions to problems. Solu- 
tions of type A are those in which there is originality and insight; 
solutions of type B are those in which old rules are inappropriately 
applied, and hence are not really solutions at all. This distinction 
does not imply that B solutions depend on previous experience 
and A solutions do not. Both depend on previous experience; the 
difference lies in the original organization that characterizes A so- 
lutions. 
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Wertheimer found geometry an especially useful area in which 
to study different approaches to problems. One of his problems, 
which he presented to adults as well as children, required the sub- 
ject to find the area of a parallelogram. Wertheimer would begin 
by showing the subject how to find the area of a rectangle, not sim- 
ply the formula of length times height, but the reason why the for- 
mula works. He did so by dividing the rectangle up into small 
squares (Fig. 2, Part A) and showing that the area was the num- 
ber of squares in a row times the number of rows. He then 
presented the subject with a parallelogram cut out of paper (Fig. 
2, Part B) and the instruction to find its area. Some people re- 
plied that this was a new problem and they couldn’t be expected 
to solve it without being told how. Some blindly repeated the now 
incorrect formula of multiplying one side times the other: a B 
type of solution. Other people attempted to find an original solu- 
tion, but were unable to see the essential relationships. A few, 
however, came up with genuine A solutions. One child, noting 
that the two projecting ends were what made the problem difficult, 
asked for scissors, cut off one end, and fitted it against the other 
end, thus converting the parallelogram into a rectangle (Fig. 2, 
Part C). Another subject achieved the same goal by bending the 
parallelogram into a ring, so that the two ends fitted together, and 
then cutting the ring vertically to convert it into a rectangle. These 
two individuals showed a genuine understanding of the situation 
that made possible correct, original solutions. 

If these individuals had applied the rule “base times height,” 
which for rectangles is equivalent to the rule “one side times 
the other,” the computation would have been correct, but never- 
theless would have demonstrated no understanding. Thus it 
would have been much like a B solution, even though it would 
have happened to be correct. What they did, however, was 
to find an original way of converting this new problem into a fa- 
miliar one, one that they knew how to solve. The final solution de- 
pended very much on previous experience, but it was previous 
experience organized in a novel way. The important thing about 
the solutions was the insight by which the new problem situation 
was restructured. From the solvers’ point of view, they converted 
the parallelogram into a better gestalt, a rectangle. 

Even when a solution is correct, it is important to distinguish 
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Fic. 2. Wertheimer's parallelogram problem. Part A shows Wert- 
heimer’s way of explaining why the area of a rectangle equals the product 
of length times width, in this case sixteen by five. Part B shows the paral- 
lelogram for which he asked subjects to find the area. Part C shows one 
person’s solution to this problem by cutting off one end, moving it to the 
other end, and thus converting the parallelogram into a rectangle. 
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whether or not real understanding is involved. Understanding 
is not the same as logic. Both the inductive and the deductive meth- 
ods of logic may be applied blindly. The inductive method, in 
which one reasons from particular instances to a general conclu- 
sion, is really just trial and error. Another person might have tried 
various possible formulas for the area of a parallelogram, found 
that base times altitude gave the same answer as the book in sev- 
eral cases, and concluded that this was the correct formula 
but without having any idea why. Though adequate for practical 
purposes, this would be a solution without understanding. Wert- 
heimer enjoys giving examples of cases where such blind induc- 
tion leads to absurdly wrong conclusions. The deductive method, 
in which one reasons logically from one principle to another, can 
also be applied blindly. A student may fumble around algebrai- 
cally until he finds a valid proof that a certain equation is correct, 
but he may still not understand the equation in the sense that Wert- 
heimer means. Understanding implies not merely logical correct- 
ness but a perception of the problem as an integrated whole, of the 
ways in which the means lead to the end. In going through an alge- 
braic proof, for example, one should ask at each step not only 
“How does this follow logically from the previous step?” but also 
“How does this lead toward the solution I am looking for?” In 
Wertheimer’s opinion, education should make such understand- 
ing, or perception of whole gestalten, its primary goal. 

The value of creative problem solving is not restricted to such 
“purely intellectual” situations as the above. Wertheimer illus- 
trates its value in social situations with an anecdote about two 
boys playing badminton. The older boy was so much better at the 
game than the younger that he won nearly every point, and the 
frustrated younger boy finally refused to play any more. Since this 
spoiled the fun for the older boy, it posed a problem for him. How 
could he get the younger boy to go on playing with him? He might 
have exhorted him to be a sport (probably in vain), or he might 
have offered a handicap (a better approach, but still not an an- 
swer to the fundamental problem posed by the one-sided com- 
petition). Seeing the competition as the crux of the problem, the 
older boy was able to propose a constructive solution. The com- 
petitive game of winning points was replaced by a cooperative 


106 COGNITIVE INTERPRETATIONS 


game of seeing how long the two together could keep the bird go- 
ing back and forth, and both were then able to enjoy play- 
ing. Again, understanding of the situation led to an insightful so- 
lution. 

This book of Wertheimer’s, set beside Guthrie’s Psychology of 
Learning, points up the contrast between cognitive and connection- 
ist views of learning in their extreme forms. Both books show a 
keen interest in the applied psychology of learning, especially as 
applied to children. Guthrie’s emphasis is on training the child to 
make the right responses to the right stimuli. His question is al- 
ways “What does the child do?” Wertheimer, on the other hand, 
is concerned with educating the chitd to have insights into the ma- 
terial. His question is “What does the child understand?” The 
difference is not irreconcilable, since Wertheimer is concerned 
with the ability to solve problems effectively and since Guthrie 
can talk about understanding in terms of movement-produced 
stimuli. Nevertheless, the difference in emphasis is tremendous. 
This emphasis on understanding, on the perception of relation- 
ships within an organized whole, is the great contribution of gestalt 
psychology to the interpretation of learning. 


LEWIN’S SYSTEM 


Among the gestalt psychologists who worked with Werthei- 
mer, Köhler, and Koffka in Berlin was Kurt Lewin (1890-1947). 
Like the other leading gestalt Psychologists, he eventually settled 
in the United States. His interests were different from theirs in a 
number of respects. Whereas they were mainly concerned with 
rather technical problems in perception, learning, and thinking, he 
was interested in motivation, personality, and social psychology. 
Gestalt psychology, in dealing with learning, tended to take the 
desire for certain goals for granted and to concentrate on the way 
the goals are obtained through cognitive restructuring. Lewin 
wanted to concentrate on the desires and the goals themselves, 
studying them in relation to the personality. The system he de- 
veloped for conducting this study is not primarily a theory 
of learning, but it is a system of description within which learning, 
motivation, personality, and social behavior can all be discussed. 
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The Life Space 


What Lewin wanted was a theoretical system for predicting the 
motivated behavior of a single individual. He found the answer 
in the concept of life space. This may be defined as the totality of 
facts which determine the behavior of a given individual at a 
given time. It is represented conceptually as a two-dimensional 
space in which the individual moves. This space contains the per- 
son himself, the goals he is seeking, the negative “goals” he is try- 
ing to avoid, the barriers that restrict his movements, and the 
paths he must follow to get what he wants. 

This concept of life space is more complicated than is at first 
apparent. For one thing, it must not be confused with physical or 
geographical space. This is not the world of physical objects and 
teal other people, but the world as it affects the individual. Hence 
an object of which he is unaware and which does not influence him 
would not appear in his life space, even though physically it may 
be close to him. Similarly, something he thinks is there and reacts 
to as if it were there is present in his life space even though physi- 
cally absent. If a child thinks there is a tiger under his bed, the tiger 
is part of the child’s life space, even if everyone else insists that the 
tiger is purely imaginary. 

Can we say, then, that the life space is the person’s environment 
as he himself perceives it? This is a hard question to answer, since 
it depends on just what we mean by “perceive.” We cannot say 
that the life space is made up only of the things of which the per- 
son is consciously aware. An individual may be influenced by fac- 
tors of which he is unconscious. For example, a certain high-school 
teacher would like to take an administrative position, and feels 
quite able to do so. Nevertheless, whenever an Opportunity to ap- 
ply for such a position comes along, he finds some excuse not to 
apply. After this has happened several times, his friends suspect 
that something is holding him back from seeking an administrative 
position, perhaps some deep-seated lack of confidence in his own 
ability. In Lewin’s terms, there is a barrier in his life space between 
him and the goal of an administrative job. Yet he insists that he 
wants such a job and is going to apply as soon as just the right op- 
portunity comes along. If we go by what he says, we would con- 
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clude that he does not perceive any important barrier between 
himself and the goal of an administrative job. If we go by what 
he does, we would conclude that he does perceive such a barrier, 
since he acts as if one is there. Which does he really perceive? 
This is a question we cannot answer, but fortunately we do not 
need to. If he acts as if the barrier is there, then it is there in his life 
space. It is from the way a person behaves that we know what is 
Present in his life space. It is often convenient to speak of the life 
space as “the environment as the person views it,” but we must 
keep in mind that what we really mean is “the environment as it 
affects his behavior.” 

The life space includes the person himself and his behavioral 
environment, which consists of everything that influences his be- 
havior. Of particular importance in the life space are the goals 
the person is seeking, the things or situations he is trying to avoid, 
and the barriers that restrict his movement toward or away from 
them. Lewin represents the life space by two-dimensional dia- 
grams (see Fig. 3). Any place, object, or situation that the individ- 
ual wants to approach (strictly speaking, acts as if he wants to ap- 
proach) is said to have positive valence, and is represented in the 
diagram by a plus sign. Any that the individual wants to avoid is 
said to have negative valence and is represented in the diagram 
by a minus sign. Barriers are represented by heavy lines sepa- 
rating one part of the life space from another. 

These diagrams are of course highly schematic. They usually 
do not correspond to physical space. For example, a politician’s 
life space might include the governorship of his state as a positive 
valence and the election as a barrier. The distances in the diagram 
between the politician, the election, and the governorship have 
no particular meaning in the life space that the diagram represents. 
What is important in the life space is the regions through which 
the politician must pass to reach his goal and the difficulties of 
getting from one region to another (i.e., the barriers), These con- 
siderations led Lewin to decide that the ordinary geometry of dis- 
tances was not appropriate for picturing the life space. He there- 
fore turned to the special kind of geometry known as topology. 
Topology is sometimes calied “rubber-sheet geometry,” since top- 
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ological space can be stretched in any direction without making 
any difference. Topology is concerned only with the boundaries 
between regions, not with the sizes or shapes of the regions or the’ 
distances from one place to another. When an area is divided up 
topologically, all that matters is what regions separate one point 
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Fic. 3. A topological diagram of a politician’s life space. At present the 
individual is practicing law. He sees three other regions available to him. 
From the mayorality or from Congress he could become a candidate for 
governor, but the boundary between the judgeship and candidacy ap- 
pears to be impenetrable. The boundary between the region of candidacy 
and the positively-valenced region of the governorship also forms a bar- 
rier, indicating serious doubts as to whether he could win this election. 
Note that the sizes and shapes of the regions could be greatly changed 
without changing the meaning of the diagram, so long as the boundary 
relationships remained the same. 


from another. Such topological space was much more convenient 
for Lewin’s purposes than the space of ordinary geometry. 

Whereas it is meaningless to ask the distance, in inches or miles, 
between a politician and his goal of being governor, it is meaning- 
ful to ask what regions he must pass through to get to the goal. 
Certainly the final region before the region of being governor is 
that of being a candidate. Before that there may be alternative 
paths—perhaps one by way of the mayoralty of a large city and 
another by way of Congress. These regions are not, of course, par- 
ticular places in physical space, but they are meaningful parts of 
the life space. They are positions that the individual can occupy. 
Positive and negative valences are attributes of certain regions, 
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as with the positive valence of the governorship. Barriers are 
boundaries between regions which are especially hard to pene- 
trate. The barrier formed by the gubernatorial election, for exam- 
ple, is the boundary between the region of candidacy and the re- 
gion of governorship. If a barrier is impenetrable for the person 
in question, he cannot enter the next region, and any path going 
to or through that region is blocked. 

These topological diagrams can be used to represent all the 
significant elements in any life situation. Positive valences may 
be attached to regions as specific as “eating candy” or as vague 
as “maintaining social status.” Negative valences may apply to 
situations as diverse as spanking for a child and guilt feelings for 
a criminal. Barriers may be physical (e.g., a locked door), intel- 
lectual (e.g., a difficult examination), or social (e.g., the exclu- 
siveness of a positively valenced club). The regions through which 
a person must pass may be places, social statuses (e.g., child, ad- 
olescent, adult), activities (e.g., practicing tennis before becoming 
a star), or any other situations that necessarily occur to the indi- 
vidual as sequences. The diagrams of the life space can apply 
equally well to a child prevented from getting a cookie (positive va- 
lence) by a high cupboard (barrier) or to a student who must pass 
through the regions of college, medical school, and internship in 
order to become a doctor. In all of these cases we must remember 
that it is the situation as it influences the person, not the “objec- 
tive” situation, with which we are concerned. If the politician 
were so sure he would lose the election that he declined to become 
a candidate, then the election would be an impenetrable barrier 
in his life space, even though his pessimism might actually be 
quite unjustified. (For a politician, such unjustified pessimism 
probably makes the example quite unrealistic.) In any case, it is 
subjective rather than objective reality that makes up the life space. 


Predicting Behavior 


The topological regions in a person’s life space and the barriers 
between them indicate what paths it is (subjectively) possible for 
him to follow. The positive and negative valences of various re- 
gions suggest which of the possible paths he is actually likely to 
follow. However, Lewin also wanted to be able to indicate the 
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relative strengths of tendencies to approach or to avoid different 
points in the life space. There are often a number of positive va- 
lences in the life space at a given time—which one will be 
approached? To answer this question, Lewin added vectors to his 
system. A vector is a force operating in a certain direction. It is 
represented by an arrow, with the direction of the arrow indicat- 
ing the direction of the force and the length of the arrow indicat- 
ing the strength of the force. Vector analysis has been most prom- 
inent in physics, where forces of different magnitudes operating 
in different directions play an important part. Lewin adopted 
only the idea of vectors, not the mathematical analysis that usu- 
ally goes with them. In his diagrams, he indicated the tendency 
to approach a positive valence with a vector pointing from the per- 
son toward the valence, and the tendency to avoid a negative va- 
lence by a vector pointing from the person away from the valence, 
The length of the arrow indicates the strength of the tendency. This 
makes it possible to show which of the forces acting on an indi- 
vidual is the strongest. 

Although the description of the life space provides a consider- 
able basis for predicting an individual's behavior, a number of 
questions remain unanswered. When two or more paths toward a 
goal are available, which will be followed? When two or more 
vectors operate on an individual, will only the stronger be effec- 
tive or will some compromise occur? If a barrier proves insur- 
mountable, will some other goal be substituted for the inaccessi- 
ble one? Lewin’s reply is that topological psychology determines 
what behaviors are possible and what ones impossible. It does not 
tell us which one definitely will occur. However, the more we 
know about the details of a person’s life space, the more we can 
narrow down the range of possibilities that are open to him. Thus 
More and more information will make our predictions better and 
better, until perhaps eventually they will become almost perfect. 

Let us consider, as an example, a student who has a strong vec- 
tor toward the region of being a teacher, and also a moderately 
strong vector toward the region of being well-to-do. We may sus- 
pect that this situation will lead to problems. However, if we know 
nothing about the topology of his life space, we cannot predict his 
behavior. If we know that in his life space these two positively va- 
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lenced regions overlap (i.e., he perceives teachers as well to do) 
then there is no problem (regardless of what the objective situa- 
tion may be). He will simply approach this goal of being a well- 
to-do teacher. If, instead, we know that in his life space the two re- 
gions are separate (i.e., teachers are not well-to-do), we are again 
unable to predict. He may simply react to the stronger vector, or 
he may find some compromise, or he may investigate the matter 
further and perhaps as a result restructure his life space (as by find- 
ing out that some teachers actually are well paid, which would 
make the two regions move together and overlap). However, if 
we can find out more about his life space, our chance of predicting 
successfully which of these things he will do will increase. If the 
region of being well-to-do includes a subregion similar to teach- 
ing (such as industrial training or school administration), this fact 
makes him more likely to approach that subregion as a compro- 
mise. If he has a vector toward further investigation of the pos- 
sibilities, this increases the chance that he will get new information 
leading to a restructuring of the life space. As our knowledge of 
his life space increases, our ability to predict his behavior also in- 
creases. 

As we include more of these complexities, the task of diagram- 
ing the life space becomes more difficult. Lewin tried hard to 
make his system complete enough to handle the great va- 
riety of human situations. He discussed the problems of regions, 
subregions, and boundaries. He searched for the best way to rep- 
resent looking at something as distinguished from going to it. He 
extended his diagrams into additional dimensions in order to deal 
with plans for the future and with fantasies. As a result, his com- 
plete topological and vector analysis is rather complicated. Most 
of these technical details are treated in his book, Principles of Top- 
ological Psychology. In addition, much of the research which he 
and his (often devoted) students carried on was concerned with 
clarifying these problems. What effect do barriers have on va- 
lences? What determines whether a substitute goal will be accept- 
able to the person? These are the sorts of questions to which Lewin’s 
research made important contributions, 

In addition to his work on the life space, Lewin also developed 
theoretical interpretations of personality structure. These inter- 
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pretations led to many experiments and applications, some of which 
are discussed in his book, Dynamic Theory of Personality. The 
only one of these that needs to be mentioned here is his concept 
of tension. Vectors result from tensions within the person. These 
tensions may be very similar to what Miller (and many others) 
refer to as drives. They may also, however, be of a less biological 
and more cognitive sort, such as the tension to complete a task 
once it is started. 

Lewin’s theoretical analyses of the person and the life space, 
along with the many experimental studies that he based on these 
analyses, made him famous as a psychologist of personality and 
of social behavior. He was always interested in the application 
of psychology to these areas. The organization of groups, the 
causes and cures of prejudice, and the effects of frustration are 
among the topics to which he made important contributions. 


Lewin’s Weakness in Predicting Learning 

From the point of view of the psychology of learning, however, 
all of the above interests are pretty much beside the point. Lewin 
provided us with a system for describing and predicting behav- 
ior, but he did not give us a theory of learning. Much of the value 
of studying his system in this book lies in understanding why he 
did not. A knowledge of the life space as described by Lewin en- 
ables, us to predict reasonably well what the individual will do. 
But how do we determine the structure of the life space? By ob- 
serving the individual’s behavior and inferring from it what the 
Structure must be. Having observed someone’s behavior, we fig- 
ure out the structure of his life space. From this structure we pre- 
dict what his behavior in the future will be. In order for this 
method to work, his life space must remain the same except for his 
Own position in it. Learning, however, involves a change in the life 
space. How can we predict when and how such changes will oc- 
cur? In some cases a knowledge of the life space (and the person in 
it) is enough, as for example when we predict that long exposure 
to a region of positive valence will produce satiation and make the 
valence less strongly positive. In most cases, however, we must 
take into account external, physical reality. If an individual 
comes to a door that he thought was unlocked, tries to open it, and 


114 COGNITIVE INTERPRETATIONS 


finds it locked, the door changes in his life space from a perme- 
able boundary to a barrier. How could we have predicted this 
change in the life space? Only by knowing that the door was, in 
fact, locked. But such questions of physical reality do not really 
enter into Lewin’s system. 

Lewin was, of course, aware of this problem, and he discussed 
the ways in which external events can produce changes in the life 
space. However, this part of his theory is very minor, and he never 
developed it in detail. In his research he showed a lively inter- 
est in finding out how best to change attitudes, but little of this prac- 
tical interest found its way into his formal theorizing. In his the- 
oretical work, for the most part, he was too busy examining the 
structure of the life space, the organization of the person, and the 
way these influence behavior to be concerned with the ways in 
which they are themselves changed by external events. As a re- 
sult, his theoretical system is useful for stating the results of cer- 
tain kinds of learning, but of little value in predicting what learn- 
ing will occur under what conditions. 

This lack of concern with learning as such reflects Lewin’s phil- 
osophical view of science. He sees most psychologists as looking 
for statistical laws that apply more or less to people (or animals) 
in general, but not specifically to any given individual. He, however, 
wants to achieve such individual prediction. One way of doing 
so would be to find general laws for predicting how individuals will 
develop throughout their life histories. If we knew enough about 
the laws of heredity, of biological maturation, and of learning, and 
if we knew enough about an individual’s ancestry and his lifetime 
of previous experiences, we could make quite accurate predic- 
tions of his present behavior. This is the approach that Guthrie or 
Skinner or Miller would have to take if he wanted to predict indi- 
vidual differences before doing any training. (Such prediction has 
not, however, been the goal of their work.) Lewin rejects this ap- 
proach. So far as he is concerned, though it might be possible in 
principle it would be hopelessly cumbersome in practice. It would 
be especially cumbersome because the effect of each learning 
experience would depend on the way it was perceived, which in 
turn would depend on previous learning. Hence Lewin rejects at- 
tempts to explain present behavior by past circumstances, which 
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would of course include previous learning. Behavior at the pres- 
ent time depends on the life space at the present time, and expla- 
Nations in terms of previous learning are inadequate and mislead- 
ing. With this bias, it is understandable that Lewin did not want 
to concentrate on problems of learning. However, in not doing so 
he left a serious gap in his theory. Without more information on 
how the life space can change, we are hard put to make long-term 
Predictions and even more hard put to make recommendations 
about teaching, psychotherapy, or other problems in learning. 


COGNITIVE AND CONNECTIONIST 
THEORY COMPARED 


At this point we may ask: why do some psychologists use the 
Stimulus-response approach and others the cognitive approach? 
We saw some of the reasons for these different preferences in 
Chapter 1. Now, having considered some of the advantages and 
Some of the disadvantages of both approaches, we may examine 
the question further. Let us consider two kinds of psychologist, 
each working toward certain objectives, and see why each chooses 
the theory that he does. ; J N 

Let us look first at a psychologist working directly with indi- 
viduals whose behavior he wants to predict. He might be, for ex- 
ample, a school counselor working with mildly disturbed chil- 
dren. He is concerned, not with children in general, but with each 
Particular child that he sees. His job is to help this child by what- 
ever means he can, regardless of whether he can explain the prin- 
ciples by which he does it. Usually the most effective thing 
for such a counselor to do is to find out as much as possible about 
this particular child and then use this information to reconstruct 
the child’s life space. This method would be Lewin’s. In using 
this method, he is not systematically applying laws, but is taking 
advantage of his own cultural background and playing whatever 
hunches he has. Cognitive theory is a more sophisticated descrip- 
tion of what he regularly does. f : 

Now let us look at another psychologist, this one doing re- 
Search on learning. He wants to find laws of behavior, laws that 
have the same kind of generality as those in the natural sciences. 
He wants to explain man’s behavior as part of the natural world, 
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as the product of man’s heredity and environment. To reconstruct 
the life space from behavior and then use it to predict new behav- 
ior is, for him, to miss the main point. He is not concerned with 
predicting one person’s behavior by any means available; he 
wants to predict the behavior of people in general by means of 
precise laws. These laws should be applicable, not only to people 
whom the psychologist knows, but also to strangers whose life 
spaces he would have little basis for guessing. 

Both of these psychologists recognize that each individual is 
unique. They differ in the place they give to this uniqueness in 
their systems. To the first, each person’s unique nature is the 
starting point from which predictions are made and on which 
studies are based. To the second, a given person’s uniqueness is 
the end result of the operation of many laws, something to be ex- 
plained rather than something to be used as an explanation. 

These considerations apply to all aspects of psychology, not 
just to learning. Are there any reasons why one or the other of 
these approaches is especially useful in the area of learning? On 
the one hand, we have already seen that the method of reasoning 
from behavior to the life space and then back to behavior is not 
by itself adequate for studying learning. This approach must be 
supplemented by some consideration of the relation between ex- 
ternal events and the life space. Because Lewin largely ignored 
this topic, his approach in this respect is less adequate than those 
of the connectionists. 

On the other hand, it is scarcely realistic to say that learning in- 
volves nothing but increases and decreases in the tendencies to 
make certain responses. The particular responses we make can- 
not generally be understood without reference to such concepts 
as beliefs and purposes. When our behavior changes as a result 
of something we are told, it seems more reasonable to attribute 
this behavior change to a change in our cognitions than to a 
change in our habits. As a result, the connectionist views we have 
looked at so far seem too narrowly mechanistic to deal with the 
complexities of human learning. 


TOLMAN’S PURPOSIVE BEHAVIORISM 


A number of aticmpts have been made to combine the advan- 
tages of cognitive and connectionist theory. Most of these have 
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been extensions of connectionist theory to include some of the 
complexities with which cognitive theories commonly deal. One, 
however, was an attempt to give cognitive theory as close a con- 
nection with external stimuli and with learning as connectionist 
theory has. This was the system developed by Edward Chace Tol- 
man (1886-1959), who for forty years was on the faculty of the 
University of California at Berkeley. Though Tolman’s theory 
might well be regarded as an attempt to solve the difficulty 
of other theories we have discussed, it was actually developed 
earlier than many of them. 

Tolman’s major work, Purposive Behavior in Animals and 
Men, was published in 1932. Though his system has undergone 
a number of modifications since then, its essential spirit remains 
the same. Writing in the heyday of behaviorism, Tolman was im- 
Pressed with behaviorism’s objectivity, its concern with the pre- 
cise measurement of behavior, and its faith in the improvability 
of man. At the same time he felt that behaviorism showed too lit- 
tle appreciation of the cognitive aspects of behavior. We do not 
simply respond to stimuli, he argued; we act on beliefs, express at- 
titudes, and strive toward goals. What we need, therefore, is a 
theory that recognizes these aspects of behavior without sacrificing 
objectivity. To fill this need, Tolman undertook to create what has 
been called a purposive behaviorism. — : y 

What is a purposive behaviorism like? Two of its characteris- 
tics are implied in the name. First, it is a form of behaviorism. 
As such, it is concerned with objective behavior, not with con- 
Scious experience. Moreover, it is concerned with the effect of ex- 
ternal stimuli on behavior, not merely with a life space inferred 
from behavior. Thirdly, it is concerned with learning, with the way 
that behavior changes with changing experience of the external 
world. 

However, this is a purposive behaviorism. Whereas Watson 
treated behavior as a matter of responses to immediately present 
Stimuli, Tolman emphasized the relation of behavior to goals. 
Most of our behavior is not so much a response to stimuli 
as a striving toward some goal. Stimuli of course guide us toward 
the goal and determine at every step what means we will use to 
Teach it, but the search for the goal is what gives unity and mean- 
ing to our behavior. We may shift from one approach to another 
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as circumstances require, while still continuing to direct our ef- 
forts toward the same goal. As a result, it would be necessary for 
anyone who wanted to predict our behavior to know the goal we 
were seeking as well as the particular stimuli we were encounter- 
ing along the way. Tolman’s system is called a purposive behay- 
iorism because it studies behavior as it is organized around pur- 
poses. 

The behavior that Tolman wanted to study is molar behavior. 
This term refers, not to the kind of behavior, but to the way in 
which it is analyzed. Molar behavior is behavior analyzed in 
fairly large, common-sense units, such as driving to work or cook- 
ing a meal. In practice, this is the way all theories of learning ana- 
lyze behavior. However, some theories include an interest in mo- 
lecular behavior, which is behavior analyzed in terms of single 
movements of particular muscles. Walking a city block, for ex- 
ample, is a molar act made up of an enormous number of molec- 
ular movements—expansions and contractions of the various 
muscles of the legs and other parts of the body. Guthrie is an ex- 
ample of a theorist who puts a good deal of emphasis on molecu- 
lar analysis. Tolman, on the the other hand, states explicitly that 
he is concerned only with molar behavior. The ways in which mo- 
lecular movements work together to produce molar acts are of no 
concern to his system. 


Cognitions as Intervening Variables 


A given goal may be approached by a great variety of dif- 
ferent acts, not only as sequences of responses but also as alter- 
Native possible ways of gaining the objective. Tolman’s problem 
was to develop a theory for dealing with this complex variablity 
of molar behavior as it operates in search of goals. To do this, he 
considered it necessary to take account of the individual’s cog- 
nitions, his perceptions of and beliefs about the world. These cor- 
respond to Lewin’s life space. How could Tolman take them into 
account without sacrificing the objectivity of behaviorism? His 
answer was to make use of intervening variables. To appreciate 
the importance of this answer we must see it in historical con- 
text. The existing behaviorist theory regarded anything interven- 
ing between the stimulus and the Tesponse as itself a response, 
just as physical and potentially just as measurable as any other 
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Tesponse. If the word “cognition” had any meaning at all for such 
behaviorists, it was either another name for tiny movements of 
the speech muscles or else a figment of some sloppy mentalistic 
theorist’s imagination. Tolman objected to both these meanings 
of “cognition.” He thought it should be possible to use the term 
“cognition” objectively without treating it as a physical, directly 
Measurable movement. He therefore designated cognitions as in- 
tervening variables. In doing so he both made the concept of cog- 
nition more respectable in behaviorist circles and introduced the 
concept of intervening variables into psychology. ; ; 

As an intervening variable, a cognition is not a thing. It is an 
abstraction defined by the theorist. While it is possible that physi- 
Ologists may some day find some particular activity in the brain 
that corresponds to a cognition, this possibility is no concerti of 
Tolman’s. The meaning of the word “cognition” is determined 
by the definition that the theorist gives it. For Tolman, this defi- 
nition is in terms both of stimuli and of responses, since it inter- 
venes between them. Experience with certain stimuli results in 
the formation of certain cognitions. . 

In addition, certain needs produce demands for certain goal 
Objects, (Deprivation of food, for example, produces a demand 
for food.) These demands are also intervening variables. Cogni- 
tions and demands work together to produce responses. 

The difference between this system and the other cognitive the- 
Ories we have discussed so far is mainly one of emphasis. For 
Lewin, the relation of the life space to external stimuli is to some 
extent a side issue. For Tolman, it is of central importance. He is 
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‘oth attempt to predict behavior from stimuli and other anteced- 
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ent conditions. Does the fact that Tolman uses cognitions as in- 
tervening variables make any real difference? Consider an in- 
dividual who makes a certain Tesponse to certain stimuli and ob- 
tains a reward. What learning does this experience tend to pro- 
duce? For Guthrie, Skinner, or Miller, it produces a tendency for 
those stimuli to be followed by that response, provided other con- 
ditions (such as drive, in Miller’s case) are appropriate. For Tol- 
man, it produces a cognition that making the response will lead 
to the reward. If he now has a demand for the reward, he 
will make the response. How are these two interpretations really 
different? 

The answer to this question is that not all cognitions take the 
form, “If I do this, I will get that.” We form many other kinds of 
cognitions about the way the environment is structured, about 
things that go together, about what paths lead to what places. 
These various sorts of cognitions can then be used when needed 
to help the individual achieve his goals. Cognitions from several 
different learning experiences may be put together so that the in- 
dividual can respond adaptively to new situations. Attention to 
such combination makes it possible for Tolman’s theory to deal 
with more original and flexible behavior than is covered by the 
connectionist interpretations of learning that we have discussed 
so far. 

One example of this greater flexibility is in cognitions about 
what leads to what. Once an individual has learned how to get 
from one place to another, he can use this knowledge to obtain 
rewards quite different from the one he obtained while learning. 
A rat that has learned its way around a maze while satisfying its 
demand for exploration can use this knowledge later to obtain 
food. This changeover is illustrated in an experiment by Buxton. 
Rats were given several experiences of spending a night in a large 
maze. There was never any food in the maze, and the rats were 
taken out of the maze at different places on different occasions. 
This variation was designed to insure that no part of the maze or 
path through the maze would be rewarded more than any other. 
After this experience, the rats were fed in the goal box of the maze 
while they were 48 hours hungry and were then put in the start 
box of the maze. About half of the rats ran to the goal box with- 
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Out a single error on this first trial, a record far above that of a con- 
trol group without the previous experience in the maze. Since the 
Tats had never been reinforced with food (or, presumably, with 
anything else) for following this particular path, it is difficult 
for stimulus-response theories to deal with this experiment, For 
Tolman, however, it is simple enough; the rats had formed a cog- 
nition (or perhaps a group of cognitions) about how the maze was 
arranged. They did not make use of this knowledge until they 
Were given a reason to do so. In other words, the learning about 
the layout of the maze remained latent until food was expe- 
rienced in the goal box. Hence this study is said to demonstrate la- 
tent learning. Several different kinds of latent learning have been 
Studied; this experiment illustrates one of them. Whenever learn- 
ing goes on without its being evident in performance at the time, 
latent learning is taking place. , 
Another way in which Tolman’s theory achieved greater flexi- 
bility than connectionist theories was through its emphasis on 
learning the location of reward. Once an individual has learned 
where a given kind of reward is located, he can often get to that 
location by means other than those he originally used. If a shop- 
per finds an intriguing store while exploring the city on foot, he 
(or more likely she) can return later by car or bus. Similarly, a 
Tat that has learned the location of food may, if the experimental 
arrangements permit, take a detour to get there faster. This is il- 
lustrated by another experiment (Tolman, Ritchie, and Kalish, 
1946). Rats ran across a table top, through an enclosed alley, and 
then along a series of elevated pathways by a roundabout route 
to food (Fig. 4, Part A). After they had learned to use this route, 
the alley was blocked, but eighteen new pathways were made 
available, leading in various directions from the table top (Fig. 
4, right side). The rats did not, as one might expect from the prin- 
ciple of generalization, choose the doors closest to the one they 
had previously used. Instead, they tended to choose the one which 
Pointed approximately toward the location of the goal. In other 
Words, they chose the one which appeared to be a short-cut to the 
goal rather than those close to the original path. They had 
not merely learned a route to food; they had also learned the lo- 


cation of the food in space. 
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sary shows how strong a hold connectionist interpretations of 
learning had on American Psychology. 
Tolman used a number of terms to refer to the cognitions that 
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are learned in various situations. One popular term is cognitive 
map. It is easy to see how the two experiments discussed above 
can be interpreted by saying that the rats had cognitive maps of the 
areas they had explored. Those in the maze had a cognitive map 
of the various alleys, indicating which were blind and which led 
on to other choice points. Those in the roundabout path had a 
map that included not only the path but also the surrounding 
Space with its possible shortcuts. If someone asks how it is possi- 
ble to have maps in the brain when the brain is made up of nerve 
fibres that conduct impulses from one place to another, Tolman 
is not concerned. Cognitive maps are intervening variables, and 
if they explain learning Tolman does not care what connection 
they have with physiology. , 
Another of Tolman’s terms, less popular but more impressive, 
1S sign-gestalt-expectation. This is the individual’s expectation that 
the world is organized in certain ways, that certain things lead to 
Others. The inclusion of the word “sign” indicates that these ex- 
Pectations are mainly about stimuli which are signs of certain 
things, rather than about responses. The word “gestalt” emphasizes 
that the signs must be considered in context, that the whole pat- 
tern of stimulation is important. This term has become one of Tol- 
man’s trademarks; in fact his theory is sometimes called sign- 
gestalt theory. 

_ These two terms point up the closeness © 
tion to those of the cognitive theorists we have alre 
is Many respects he was particularly close to Lewin, 
Towed heavily from Lewin in later formulations of his the 
man’s later theoretical publications are full of diagrams of vectors 
and valences. He also added to Lewin’s system of diagrams by 
Making a distinction between pragmatic performance vectors 
and identification performance vectors. The pragmatic perform- 
ance vectors refer to the tendencies actually to g0 to various places 
Or to make other responses of acting on the environment. The iden- 
tification performance vectors refer to the tendencies merely to ob- 
Serve things, They are concerned with exploring the situation, with 
Observation of the environment. Thus Tolman distinguishes tend- 
Encies to observe the environment, either out of curiosity or In 


f Tolman’s formula- 
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order to decide what to do about it, from tendencies to take action 
toward some part of the environment. 
Six Kinds of Learning 


We can see more of Tolman’s relation to other theorists if we 
look at the six different kinds of learning that Tolman dis- 


on some food would give rise to a demand for that food. 
The second kind of learning involves £quivalence beliefs. These 


it was worth Tolman’s trouble to separate this kind of learning 
from the forming of cathexes, 
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These first two forms of learning both depend on experiencing 
reward (or punishment). This feature is not present, however, 
in the third kind of learning. This is the formation of field expect- 
ancies. These are cognitions about the way the world is laid out, 
about what leads to what. They are essentially the same as what 
Tolman earlier called sign-gestalt-ex ectations. Nearly all of 
what we ordinarily call knowledge comes under this heading. 
When we learn the route from one place to another, or when we 
learn what tools can be used for what purposes, we are forming 
field expectancies. Cognitive maps are made up mostly of field 
expectancies. The paths and barriers of Lewin’s life space rep- 
Tesent field expectancies. These field expectancies develop through 
experience with the objective world. Demands and rewards may 
influence them indirectly by changing the individual's focus of at- 
tention, but they have no necessary connection with either demands 
or rewards. Thus we might say that Tolman is a reinforcement 
theorist with regard to cathexes and equivalence beliefs, but not 
with regard to field expectancies. It is in this third kind of learn- 
ing that Tolman proclaims himself clearly as a cognitive theo- 
rist, It is thus the heart of his system. 7 

The fourth kind of learning consists of field cognition modes. 
These are different from anything we have discussed so far, and 
Tolman does little more than suggest that they exist. Field cogni- 
tion modes are ways of learning, or biases toward learning cer- 
tain things more readily than others. To some extent, of course, 
these are innate, but they are also to some extent learned. Probably 
the most striking example of a field cognition mode is our ee | 
to use language in a tremendous variety of learning situations. 

a human learns a maze in the laboratory, he is likely to learn it as 
a verbal sequence of lefts and rights. This use of language makes 
the process of maze learning different for humans from what it 
is for animals. The learning of the maze 1s itself a matter form- 
ing field expectancies, whether the learning is animal or human. 
The way in which it is learned, however, depends on field cogni- 
tion modes. The original learning of language as a means for 
learning other things was therefore the learning of a field cog- 
Nition mode. The differences in IQ scores and in personality among 
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people raised in different cultures are partly attributable to differ- 
ences in field cognition modes, since these produce differences in 
what people will learn from the same (objectively speaking) 2x- 
perience. 

The fifth variety of learning in Tolman’s system is drive dis- 
crimination. This is the ability to distinguish among different 
drives. For example, there is some evidence that animals have to 
learn the difference between hunger and thirst. This learning is 
of course closely related to the learning of cathexes, in which de- 
mands are related to goal objects. This appears to be a rather. trivial 
category, and there is some question as to why Tolman bothered 
to treat it as separate from other forms of learning. 

The sixth and final kind of learning is motor patterns. This is 
essentially Tolman’s concession to, of all people, Guthrie. Guthrie 
had criticized Tolman’s system as concerned only with the individ- 
ual’s cognition that certain acts would obtain certain goals, not 
with the acts themselves. Guthrie satirized this aspect of Tolman’s 
system as leaving a rat in a maze “buried in thought.” He claimed 
that if the rat ever got to the food, this was the rat’s business, not 
the business of Tolman’s theory. Tolman, open to suggestions from 
his critics as well as from his supporters, took account of this ob- 
jection in listing the different kinds of learning. The muscular 
skills by which one actually gets to goals were never of much con- 
cern to Tolman’s study of molar behavior, but he granted that they 
would have to be considered in a complete theory of learning. 
How does one learn to ride a bicycle without falling, to fire a rifle 
accurately, or even to walk? Tolman is not really concerned with 
answering these questions, and does not feel obliged to fit the topic 
closely into his system. He suggests that Guthrie’s analysis in 
terms of stimulus-response connections learned by contiguity may 
be appropriate for this kind of learning. 

This list of the six kinds of learning shows Tolman’s borrowing 
from Freud, Guthrie, and the reinforcement theorists. We have al- 
ready referred to his use of Lewin’s diagrams. This breadth is also 
reflected in the variety of variables that he took into account in 
explaining behavior. In addition to all the aspects of the situation, 
he listed four main kinds of individual-difference variables. Only 
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one of these, we may note, involves learning. The four are he- 
redity, age, training, and endocrine, drug, or vitamin conditions. 
Using the initial letters of these four, Tolman labeled them the 
HATE variables. (One wonders how Tolman, a devout Quaker, 
came to arrive at this particular set of initials.) The HATE varia- 
bles help to illustrate the breadth of Tolman’s concern with dif- 
ferent aspects of behavior. Discussions of such diverse factors 
as age and vitamin conditions are rare in learning theory. They are 
valuable as a reminder that learning, for all its great importance, 
is far from being the only determinant of behavior. 


Of Rats and Men 

Tolman is probably unique among cognitive theorists in hav- 
ing given more attention to the behavior of animals than of hu- 
mans. (This was also Kohler’s emphasis at one stage of his career, 
but he also did extensive work in human perception.) To some 
extent the preference reflects the same factors that have influenced 
behaviorists in general toward animal work: the greater simplicity 
of animals and the greater opportunity to control their environ- 
ments. It probably also reflects Tolman’s special concern to show 
that a cognitive system could still be objective, that it need not de- 
pend at all on anything the individual says. Tolman himself, in the 
tongue-in-cheek manner so characteristic of his writing, presented 
a third possible explanation. He suggested that psychologists typi- 
cally begin with an interest in solving the great problems of human 
life, then become frightened by the awesome implications of 
such a task and flee to safer aspects of the study of behavior, such 
as learning in the rat. Certainly Tolman is right in saying that many 
psychologists are aware of a conflict between their desire to at- 
tack the great questions of life directly and their desire to concen- 
trate on other questions that are less exciting but that they have 
more chance of being able to solve. Tolman’s own career reflects 
this conflict clearly. 

On the one hand, Tolman was a man of strong social conscience. 
(He was one of the faculty members who left the University of 
California rather than sign the controversial loyalty oath in 1950.) 
In 1942 he published a small book called Drives toward War, in 
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which he analyzed the psychological causes of war and presented 
some suggestions for removing them. His analysis of the biological 
drives, social techniques, and psychological dynamisms leading 
to behavior, warlike or otherwise, combined experimental, 
clinical, and historical sources. On the basis of this analysis he sug- 
gested several fairly radical changes in our political, economic, 
and educational systems, changes which he thought would reduce 
the impetus toward war. Among these was a plan for a world 
state, along with some psychological suggestions for making this 
admittedly visionary idea a bit more practical. 

On the other hand, Tolman had as his principal ambition the 
development of a schematic rat world, a system from which 
he could predict completely the behavior of rats in a laboratory 
environment. This is reflected in his whimiscal dedication of 
Purposive Behavior to Mus norvegicus albinus (actually the white 
mouse, but he meant the white rat). He wanted to include enough 
different variables and to have flexible enough constructs to achieve 
this goal completely. In the presence of the tremendously complex 
human social situations that the layman usually wants the psy- 
chologist to predict for, this seems like a very modest ambition. He 
did not, however, achieve his goal. 


Evaluation of Tolman 
If we consider how Tolman combined the best of both con- 
nectionist and cognitive theory, what a broad range of variables 
he took into account, and how early he anticipated later develop- 
ments in the logic of theory building (the use of intervening vari- 
ables), we could easily conclude that Tolman is the greatest learn- 
ing theorist we have considered. In its conception, his theory may 
very well be the best there is. The conception, however, was never 
really carried into execution. Tolman discussed the kinds of laws 
psychology needs, but he did not develop these laws. He did ex- 
eriments intended to show that cognitive formulations are bet- 
ter than connectionist ones, but he did not do experiments to make 
these cognitive formulations precise enough to be really useful for 
prediction. He pointed the way toward an extension of cognitive 
theory to include the best aspects of connectionist theory, but he 
did not carry the program through. He is therefore open to the 
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same criticism as the other cognitive theorists, that he does not 
give us a basis for predicting from objective stimuli to objective 
behavior. He gave us a cognitive framework (or perhaps we 
should say a gestalt) for interpreting learning, but he did not pro- 
vide us either with detailed laws of learning like Skinner’s or with 
a general principle of learning like Guthrie’s. Thus his system is 
more a road sign or a pious hope than it is an accomplished fact. 

This shortcoming is not really a failure on Tolman’s part, how- 
ever, for his attitude toward his own theorizing was always tenta- 
tive and whimsical. He regarded theory-construction not as a 
serious business of building great intellectual edifices for the fu- 
ture but as a half playful trying out of different approaches. As a 
result he was constantly revising his theory, making suggestions 
and then abandoning them before he had time to explore their 
implications. His writings are a mixture of earnestness and 
whimsy, of high-flown theorizing constantly being pulled up short 
by a chuckle. His desire to construct a schematic rat world was 
thus partly an aspiration but also partly a joke. He wanted not 
so much to build a truly adequate theory as to explore the whole 
activity of theory building, both playing with it himself and punc- 
turing the excessive claims of others. As a result, he has been 
widely respected and widely loved, but not widely followed. 

Tolman died in the same year as Guthrie, and his final theoreti- 
cal statement appeared in the same book as Guthrie’s (Tolman, 
1959). He ended his chapter with a statement of what he had tried 
to do in his theorizing, a statement that stands as a sort of final sum- 
mary of his carcer in psychology. He admitted some doubt as to 
how well his work would stand up to the canons of scientific proce- 
dure, but indicated that standing up to canons was not what mat- 
tered most to him. He had followed “his own gleam and his own 
bent,” which is the most that any scientist can do. And, most im- 
portant, he had had fun. 

Tolman’s system, incorporating as it does ideas from many 
sources, is the outstanding attempt to develop a cognitive interpre- 
tation of learning that would have the virtues of connectionist 
theories without their limitations. Though it has much to recom- 
ment it as the outline for a complete theory of learning, it has 
failed to achieve a wide and lasting following. Tolman himself 
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wondered, in his final statement, if the system might be too out 
of date to deserve such a careful reconsideration. MacCor- 
quodale and Meehl (1953) had presented a more formal version 
of some aspects of Tolman’s theory, but this attempt did little to 
provoke new research or theoretical work. For the currently ac- 
tive attempts to combine the best features of cognitive and con- 
nectionist theory, we must look elsewhere. These current attempts 
have come from connectionists trying to expand and liberalize 
their theories, rather than from primarily cognitive theorists. They 
have developed for the most part in connection with the formal 
connectionist theories, and should be considered in that context. 
It is to these formal connectionist theories and their relations to 
cognitive theory that we turn in the next two chapters. 


ajs 


A MAJOR FORMAL 
CONNECTIONIST THEORY 


THE MOST AMBITIOUS and most famous of the formal connectionist 
learning theories is that of Clark L. Hull (1884-1952). Long a pro- 
fessor at Yale, Hull probably had more effect on the psychology 
of learning than any other theorist in the past thirty years. He was 
a behaviorist in the Watsonian tradition, but far more sophisti- 
cated than Watson in the philosophy of science. His early training, 
before he became a psychologist, was in engineering, and some- 
thing of the engineer's outlook is evident in his desire to construct 
an elaborate, formal, precise structure of psychological theory. 


The Postulational Method of Theory Construction 

Hull’s concept of the ideal theory was a logical structure of pos- 
tulates and theorems, similar to Euclid’s geometry. The postu- 
lates would be statements about various aspects of behavior. 
They would not be laws taken directly from experiments, but 
more general statements about the basic processes involved. Like 
the postulates of geometry, they would not themselves be proved 
but would be taken as the starting points for proofs. From these 
postulates, a great variety of other statements, called theorems, 
could be logically derived. Each theorem could be proved by ar- 
guing logically from some combination of postulates. These theo- 
rems would be in the form of laws of behavior. 

So far, such a theory is simply a logical creation. We have said 
nothing about whether its statements are true or false, only about 
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how they hang together logically. When a theorem is proved, all 
this means is that if the postulates are true, the theorem must also 
be true. In order for the theory to have any value as a descrip- 
tion of the real world, it is necessary to compare the theorems with 
actual laws of behavior as determined by experiments. In other 
words, after the theorist has determined by logic that the theorems 
follow from the postulates, he must then determine by experiment 
whether they are true. If they are, the whole theory is supported; 
if not, the theory is weakened and Tequires revision. , 
To anyone not acquainted with the philosophy of science, this 
approach may seem backward. The theorist starts with postulates 
that may or may not be true. He then proves logically that if 
the postulates are true, certain theorems must also be true. 
Next he determines by experiments whether each theorem is in 
fact true. Finally, he uses the truth or falsity of the theorems 
to argue indirectly about the truth or falsity of the postulates. If 
a theorem turns out to be false, he knows that at least one of the 
postulates from which he proved the theorem must also be false, 
since it led logically to a false conclusion. In consequence, some 
postulate (the theorist may not yet know which one) must be 
changed so that the theorems which follow from it will be true. If, 
instead, all the theorems turn out to be true, this verification in- 
creases his confidence that all the postulates are true. However; 
he can never be absolutely sure that the postulates are true, since 
false postulates can sometimes lead to true theorems. As more 
and more theorems turn out to be true, he becomes more and more 
confident that the postulates are true, but there is always the pos- 


sibility that eventually a theorem derived from the postulates will 
turn out to be false. 


Although this theoretical 
cated and Strange, it is a 
do in many familiar 
cerned about the poo 


knows has a high 1Q. She suspects that his difficulty may reflect 
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since fear is not directly observable. They can, however, be tested 
indirectly. To test them, the teacher must first figure out how a 
student who had this fear and this reaction to fear would behave 
in certain situations. This is the process of deducing theorems 
from postulates. Thus, if the teacher’s “postulates” about the stu- 
dent are true, it should also be true that: (1) he will do better on 
test items if they are presented to him casually in conversation than 
if they are part of a test, (2) if given an opportunity to compete for 
a prize, he will refuse, and (3) if offered a chance to give a talk to 
the class on some topic that interests him, he will accept. The 
teacher may try each of these approaches and see what happens. 
These, then, are three experiments, each designed to test one of 
the teacher’s “theorems.” (We assume the design is sound.) If in 
each case the student does what the teacher predicted, her confi- 
dence in her interpretation (i.e., in her “postulates”) will increase. 
She cannot, however, rule out the possibility that all of his behavior 
is really due to some other combination of factors, and that even- 
tually he will do something that does not fit at all with her inter- 
pretation. She can become more and more confident that her inter- 
pretation is correct, but she can never be absolutely sure of it. 

If it turned out that her first two predictions were confirmed, 
but the third was not, she would know that her “postulates” were 
not entirely correct. She might decide, since he refused to address 
the class even when no competition was involved, that his fear was 
not merely of competition but of any situation in which he was 
threatened with failure or disapproval. She could then look for 
new situations in which to test this revised “postulate.” In this way 
her interpretation of his behavior would be a self-correcting proc- 

ing and testing assumptions. Dr T 
E En in is example and Hull's thinking in 
constructing a scientific theory are of course markedly different 
in scope and in logical formality. These are differences of degree, 
however, rather than of kind. The teacher was concerned with very 
narrow interpretations, involving the behavior of a single student, 
while Hull was concerned with very broad ones, involving the 
whole range of human and animal behavior. In addition, Hull 
wrote down his postulates in detailed form and tested the theo- 
rems in controlled experiments, whereas the teacher merely de- 
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veloped hunches and tested them in a rough-and-ready fashion. 
Nevertheless, the two cases are basically similar enough to show 
the kind of reasoning that Hull foliowed in building his theory. 

It should be clear from the above that Hull did not regard his 
theory as a final statement about the nature of learning. Rather, 
it was intended as a tentative formulation, always subject to re- 
vision to bring it in line with new data or new ideas. This tenta- 
tiveness was a necessary characteristic of the kind of system he 
wanted to build. A set of postulates from which many laws of be- 
havior are supposed to follow by strictly logical deduction has 
more chances of being wrong than does an informal system that 
makes fewer or less exact predictions. Watson’s and Guthrie’s the- 
ories are informal enough to leave considerable doubt as to just 
what they would predict in a particular situation, Skinner's and Tol- 
man’s systems are mainly systems of terms and methods for gath- 
ering and describing data. Hull, however, intended to create a the- 
ory specific enough so that it would be easy to see when it was 
contrary to the evidence. A theory of this sort is practically certain 
to be wrong in some respects when it is first formulated, hence it 
must be open to change as these errors are discovered, 

It was Hull’s plan to write three books expounding his theory. 
The first was to present and explain the Postulate system. This ap- 
peared in 1943 as Principles of Behavior. Though various revi- 
sions of the postulate system have been published since then, this 
book remains the cornerstone of Hull’s theoretical writing. The 
second book was to contain detailed derivations of individual be- 
havior in a variety of situations. This was published in 1952 as 
A Behavior System. It contains 133 theorems with their deriva- 
tions from the postulates and the evidence bearing on their correct- 
ness as laws of behavior. The topics with which these theorems 
deal are as diverse as discrimination learning, locomotion in space, 
and the acquisition of values. An invalid during the last few years 
of his life, Hull was barely able to finish this second major book. 
In the preface, he expressed his regret that the third book in the 
series, which was to present derivations of behavior in social in- 
teractions, would never be written. Four months later he died. Al- 
though his trilogy thus remains incomplete, it gives us a good pic- 
ture both of the grandness of Hull's conception and of the many 
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respects in which the actual system fell short of Hull’s aspirations. 
It is possible that if Hull had lived longer and had been in better 
health during his last years, many of the shortcomings of his the- 
ory would have been ccrrected. As it is, we must judge both his 
successes and his failures by his works as they stand. 


HULL’S BASIC POSTULATES 


Hull does not make Skinner’s distinction between respondent 
and operant behavior. For him, as for Watson, Guthrie, Thorndike, 
and Miller, all behavior involves stimulus-response connections. 
A response is never simply emitted; it is always a response to a 
stimulus. However, Hull is more explicit than any of the above 
four regarding all the factors other than the stimulus that influ- 
ence the response. His postulates are concerned largely with the 
systematic presentation of these various other factors and their 
interrelationships. As might be expected, this system of postulates 
and theorems is much more complicated and technical than any 
of the other systems we have discussed so far. These qualities 
make it more difficult to capture the flavor of Hull’s writing in a 
brief account. We shall begin with a relatively nontechnical account 
of Hull’s system, followed by some examples of his attempts at 
precise deduction and quantification. 


The Four-Stage Analysis 

Like most theorists, Hull wanted to develop a system for pre- 
dicting the dependent variables of behavior from various inde- 
pendent variables. Recognizing the variety of independent and 
dependent variables with which he had to deal, he tried to simplify 
the task of prediction by introducing intervening variables. We saw 
in Chapter 1 how the use of such intervening variables makes it pos- 
sible to summarize a great many details with a small number of 
concepts. Hull organized his intervening variables into a four- 
stage predictive scheme. The first stage consisted of the independ- 
ent variables from which he was predicting, the fourth stage of 
the dependent variables to which he was predicting, and the second 
and third stages of intervening variables connecting them. Know- 
ing the values (amounts or degrees) of the independent variabies, 
he could compute the values of the intervening variables at stage 
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2. From these he could in turn compute the value of the inter- 
vening variable at stage 3, and from that finally predict the values 
of the dependent variables. The general outline of this elaborate 
scheme is given in Fig. 5, which may serve as a guide to the expla- 
nation below. 
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Fic. 5. A schematic representation of Hull's basic system. The inde- 
pendent variables influence the intervening variables, which in turn in- 
fluence the dependent variables. Some elaborations of this simplified 
scheme are discussed later in the text. 


The independent variables include all those that can be directly 
manipulated by an experimenter. These may be summarized 
briefly, since they are discussed in more detail in connection with 
the intervening variables. Some of the independent variables re- 
fer to the stimulation the learner is receiving at the moment, such as 
the brightness of a signal light or the intensity of electric shock. 
Others refer to immediately preceding events, such as the number 
of hours since his last meal or the amount of muscular effort he 
has recently put forth. Still others refer to previous experience in 
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the same learning situation, such as the number of times he has 
previously made the response to be learned or the magnitude of 
the reward that he received the last time he made the response. 
The number of such independent variables that could be men- 
tioned is endless; the important thing for our understanding of 
Hull is the way that he has organized them in his system. 


Habit Strength and Drive 

The second stage of the analysis introduces intervening vari- 
ables. These are hypothetical states of the organism that cannot be 
observed but that are assumed to be directly controlled by the in- 
dependent variables. The two most prominent ones, habit strength 
and drive, differ only in detail from the corresponding items 
in Miller's system of drive, cue, response, and reward. Habit 
strength refers to the strength of the learned connection between 
a cue or cues and a response, a connection built up through rein- 
forced practice. Drive, for Hull as for Miller, is an activated state 
of the organism, and a reduction in drive serves as a reward. 

Since Hull is a connectionist theorist, habit strength is a key 
concept in his system. It is the strength of the bond connecting a 
stimulus with a response. To indicate the nature of this connec- 
tion, habit strength is abbreviated sHr (pronounced SH R), 
the H standing for habit and the S and R subscripts standing for 
the stimulus and the response which the habit connects. This habit 
is a permanent connection, which can increase but cannot de- 
crease in strength. All long-term Jearning involves the formation 
and strengthening of habits. Each time a response occurs in the 
presence of a stimulus and this event 1S quickly followed by re- 
inforcement, the habit strength of this stimulus-response connec- 
tion increases. In this assumption (Postulate 3 in the final version 
of his system), Hull resembles all other reinforcement theorists. 
Like Miller but unlike Skinner, he goes on to say that all reinforce- 
ment involves a reduction in the strength of a drive stimulus. The 
rate at which habit strength builds up with successive reinforced 
responses (given by an equation in Postulate 4) follows the well- 
known “law of diminishing returns,” so that each successive rein- 
forced response contributes less to sHr than the previous one. 
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Eventually a point is reached at which additional reinforced re- 
sponses contribute very little more to habit strength. 

Drive is a temporary state of the organism, produced by dep- 
rivation of something the body needs or by painful stimulation. 
There are many different specific drive conditions, of which hun- 
ger, thirst, and pain are typical examples. Drives have two differ- 
ent functions. For one thing, each drive condition, such as hunger 
or thirst, produces a characteristic drive stimulus. This stimulus 
indicates the particular need from which the body is suffering. A 
rapid reduction in this drive stimulus is reinforcing. As a result, 
any response that occurs just before a reduction in a drive stimu- 
lus tends to be learned as a response to whatever stimuli are pres- 
ent. (We have already seen this relationship in connection with 
the postulate about the building up of habit strength.) The other 
function of drives is an activating or energizing one. All drive con- 
ditions combine to make up the total drive level of the organism. 
This total drive (abbreviated D) serves to raise the individual's ac- 
tivity level. This activating effect of D can be seen both in an in- 
creased level of general body activity and in the increased vigor 
with which all learned habits are performed. This analysis is essen- 
tially the same as Miller’s (not surprisingly, since Miller based his 
on Hull’s), but somewhat more detailed as well as more precisely 
stated. 


Incentive Motivation 


Unlike either Skinner or Miller, Hull was concerned with the 
size of the reward used as a reinforcer. We are all familiar with the 
law that people tend to work harder for a larger reward than for a 
smaller one. In other words, the level of performance is commonly 
higher when a larger reward is given after the response occurs. 
How should this law be handled by the theory? In his 1943 book, 
Hull treated the magnitude of the reward as one aspect of rein- 
forcement. The larger the reward, the greater the reduction in 
drive, and hence the greater the increase in habit strength. How- 
ever, this treatment proved not to be satisfactory. Experiments 
showed that changes in the magnitude of the reward produced very 
rapid changes in the level of performance, much faster than the slow 
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growth of habit strength could explain. This difference was partic- 
ularly difficult for Hull to explain when the change was from a 
larger to a smaller reward. If the reward was made smaller and 
performance became poorer, this result seemed to mean that the 
habit strength had become less. Yet habit strength involved a per- 
manent bond, one that became stronger with reinforced practice 
but never became weaker. Here was a striking example of a dis- 
crepancy between theory and experiment. How would Hull deal 
with it? 

Hull’s answer, in the later versions of his theory, was to intro- 
duce a third intervening variable at the second stage of the analy- 
sis. Along with habit strength and drive he added incentive moti- 
vation (abbreviated K). The magnitude of the reward, in the new 
version, affects only K, not habit strength. The level of K depends 
on the size of the reward on the few immediately preceding trials. 
When the size of the reward is increased, at whatever stage of prac- 
tice, K increases, and when the reward is decreased, K decreases. 
Incentive motivation thus refers, as the name implies, to the mo- 
tivating effect of the incentive that is provided for making the re- 
sponse. The distinction between incentive motivation and rein- 
forcement is subtle, since both depend on the reward, but the dif- 
ference is none the less important. In order for a habit to increase 
in strength, the response must be followed by a reward that re- 
duces a drive stimulus. The size of the reward makes no differ- 
ence to the rate at which habit strength is built up; any reinforce- 
ment is as good as any other. The size of the reward does, how- 
ever, affect the level of incentive motivation. Large rewards make 
for higher values of K, smaller rewards for lower values. When 
a rat learns to run down an alley for food, we see a gradual in- 
crease in its speed, an increase resulting from reinforced practice. 
However, over and above this slow increase in speed, we can in- 
crease or decrease the speed rapidly by changing the size of the 
food pellet we give at the end of each run. In the oe yon, ray 
be possible to increase @ worker’s output immediate T E 
him more pay for each item produced (thus increasing ), vw n 
it would take much longer to increase his output by training him 


in better work methods (increasing habit strength). 
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Excitatory Potential 


These three intervening variables, sHg, D, and K, work together 
to produce another intervening variable, which constitutes the 
third stage of the analysis. This is called excitatory potential and 
refers to the total tendency to make a given response to a given 
stimulus. It is abbreviated sE», according to the same principle 
as sHr. It is equal to the product of the three other intervening var- 
iables; in other words, sEr = sHp X D X K. This equation means 
that the tendency to make a given response to a given stim- 
ulus depends on a habit built up through reinforced practice 
(sHr) and on two motivational factors, one depending on an in- 
ternal state (D) and the other on an external incentive (K). For 
everyday purposes, one might refer to the sHx for a given response 
as “knowing how to do it,” to the K as “knowing what is 
to be gained by doing it.” and to D as “wanting the thing that is 
to be gained.” Hull, however, would not use this terminology, as 
he would consider it too vague and cognitive for an objective 
science of behavior. For Hull, sHr, D, and K are defined by the 
operations that produce them (reinforced practice for sHr, dep- 
rivation or painful stimulation for D, nutritive substance for 


K), not by any vague, everyday names that we might choose to 
give them. 


The Dependent Variables 


The fourth and final stage of Hull’s analysis is made up of the 
dependent variables, the aspects of behavior that can actually be 
observed and measured. Hull relates three of these to excitatory 
potential (sEr): (1) the amplitude or size of the response, (2) the 
speed of the response, and (3) the total number of responses that 
will occur, after reinforcement is removed, before extinction is 
complete. As sEp increases, amplitude, speed, and number of re- 
sponses to complete extinction all increase. 

What Fig. 5 shows, and what we have discussed, is only the 
barest outline of Hull’s (1952) system. The final version of his 
theory had 17 postulates and 15 corollaries (theorems that follow 
immediately from a single postulate), which he hoped would cover 
the whole range of known behavioral phenomena. For example, 
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he postulated the existence of unlearned stimulus-response con- 
nections, but did not elaborate the topic and rarely referred to 
such innate connections in his theorems. He discussed the effect 
of a delay between the Tesponse and the reinforcement in weak- 
ening the response tendency. He went into considerable detail 
about generalization and about the way generalized habit strengths 
combine (i.e., what happens when the same response is learned to 
two different but related stimuli). These details are beyond the 
scope of this book. 


OTHER POSTULATES AND COROLLARIES 


A number of elaborations, however, deserve fuller discussion 
here. One of these is the concept of threshold. This is a minimum 
value that excitatory potential (sEr) must reach before any observ- 
able response will occur. In other words, a very small value of 
sEr for a given response will not result in a very weak or very 
slow response, but in no response at all. Only when the excitatory 
potential reaches a level greater than the threshold value will 
any response occur. Hull took the concept of threshold from the 
study of neurophysiology and of sensation and applied it to the 
field of learning. It implies that in many cases there will be an 
initial period of learning experience in which apparently no learn- 
ing is taking place. Learning is occurring, but it is not yet appar- 
ent in behavior. This is because the combination of sH} X D 
X K gives a value of sEr smaller than the threshold value. An in- 
crease in either sHp or D or K might be enough to raise sEr above 
threshold, If so, the learned response would then occur on the next 
presentation of the stimulus. , , 

A second concept is oscillation. This refers to the assumption 
Hull makes that any given amount of excitatory potential is not an 
exact value but the average of a random distribution of values. 
When sHpr and D and K have been multiplied together to give the 
value of sEr, we still cannot predict with certainty what the am- 

i d of the response will be. The reason is that the 
piene gi spes i itatory potential mo- 
oscillation of sEr may have carried the excitatory potential mo 

ily above or below the value computed. The oscillation in 

ae i E, is rapid and random (following approximately 
valu sER i 

pai distribution). The fact that Hull found it necessary to 
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introduce this concept shows that his system, for all its complex- 
ity, could not hope to achieve perfect prediction of behavior. An 
element of random, unpredictable variation had to be introduced 
to make the system at all realistic. 

Oscillation implies that a given response may occur on one 
trial and not on the next, in spite of the fact that sEx is greater (due 
to the one additional reinforcement) on the latter trial than on the 
former. The reason is that the oscillation of sE, brought its mo- 
mentary value lower (and below the threshold) on the latter trial 
than on the former, even though the average value of the possible 
range of oscillation was higher on the latter trial. Therefore it 
is possible in memorizing a series of items to get one right on an 
earlier trial and wrong on a later trial. Guthrie would explain the 
same phenomenon by pointing out that the exact stimulus situa- 
tion varies from moment to moment, so that the response can rea- 
sonably be expected to vary also. Since these momentary stimu- 
lus changes are both too slight and too complex to measure prac- 
tically, Hull prefers to talk about oscillation as a separate inter- 
vening variable rather than to assume that all oscillations in the 
response are caused by variations in the stimulus. 

When two incompatible responses have been learned to the 
same stimulus, so that two sEgs are competing for expression, the 
principle of oscillation may be used to predict which will occur 
on any given trial. The oscillations of the two sExs are not synchro- 
nized, so that when one sEr is momentarily high the other may be 
high, medium, or low. If the ranges of oscillation of the two sErs 
overlap, each response will occur on some trials. If the sEg for one 
response is considerably stronger than that for the other, so that 
their ranges of oscillation overlap only slightly, the stronger will 
occur most of the time and the weaker only rarely. If the two are 
equally strong, so that their ranges of oscillation coincide, each 
will occur half of the time. All such situations involve the conflict 
between two response tendencies, and the degree of overlap of 
the two ranges of oscillation makes it possible to predict how of- 
ten each of the two competing responses will occur. 

Third, there is the concept of reactive inhibition. This is a tend- 
ency not to repeat a response that has just been made. The 
amount of reactive inhibition resulting from a response depends 
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on the amount of effort required to make the response, so that re- 
active inhibition is roughly equivalent to fatigue. Since this tend- 
ency not to respond works against and tends to cancel the effects 
of excitatory potential (sEx), it is subtracted from sEg to give the 
net excitatory potential used in predicting the amplitude and la- 
tency of responses. The abbreviation for reactive inhibition is Ip. 
The fact that there is no subscript S indicates that the inhibition is, 
against any making of that response, regardless of the stimulus that 
produces it. The total amount ef Ip against a given response in- 
creases each time the response occurs but decreases with the pas- 
sage of time. It is for this reason that subjects usually perform 
better on a learning task involving a lot of effort if their practice 
is distributed, with short periods of practice separated by short 
periods of rest. These rest periods provide an opportunity for Ip 
to dissipate. Hull also uses reactive inhibition to explain why ex- 
tinction occurs. Once reinforcement is removed, successive re- 
sponses no longer build up sHx but continue to build up Ir. As 
a result, inhibition gradually overcomes sEr until no further re- 
sponding occurs. However, after a period of time the Ir dissipates 
and the response again occurs, giving spontaneous recovery. 

There is a derivative of reactive inhibition called conditioned 
inhibition. This is abbreviated sIp to indicate that it is a kind of 
habit connecting stimulus and response. Conditioned inhibition 
may be thought of as a habit of nonresponding produced by re- 
active inhibition. We can think of Ir as a negative drive that can 
be reduced by not making the response. Whenever the individual 
who has been responding ceases to respond, this “response of 
stopping responding” is reinforced by the fact that Ip starts to dis- 
sipate. The habit of stopping responding that is learned in this way 
is sIr. This slp explains, among other things, why spontaneous re- 
covery is not complete. The Ir built up during extinction dissipates 
with time, but the slp learned in the process remains. It is note- 
worthy that Hull does not treat Ir and sIr consistently as a drive 
and a habit, respectively. In his equation he simply subtracts both 
Ir and sIr from sEr. 

A fourth elaboration of Hull’s theory is found in two corollaries 
concerning secondary reinforcement and secondary drive. These 
are closely related to Skinner’s concepts of conditioned positive 
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reinforcers and conditioned negative reinforcers, respectively. 
When a neutral stimulus (one that does not act either as a drive 
or as a reinforcer) is paired with a sudden reduction in a drive 
stimulus, the neutral stimulus tends to acquire reinforcing proper- 
ties. For example, if a rat is frequently fed in a white box, the box 
will become a secondary reinforcer. It can then be used, even with- 
out food, to reinforce some new response. If the rat, still hungry, 
is put in a situation where turning right will bring him to the empty 
white box and turning left will bring him to an empty black box, 
he will learn to turn right. The reinforcement for turning right is 
not drive reduction (primary reinforcement) but secondary re- 
inforcement from the white box. Similarly, money becomes a sec- 
ondary reinforcer for humans because, even though worthless as 
an object, it makes possible so many rewards. 

Once a secondary reinforcer has been established through as- 
sociation with primary reinforcement, it is possible to use it to es- 
tablish another neutral stimulus as a secondary reinforcer. In 
principle, this process can be continued indefinitely, with a pri- 
mary reinforcer being used to condition a secondary reinforcer, 
that secondary reinforcer used to condition another, that one a 
third, and so on, In practice, however, this sequence is quite dif- 
ficult to accomplish, since the reinforcing properties of secondary 
reinforcers tend to extinguish when primary reinforcement is no 
longer paired with them. 

Secondary drive is already familiar from Miller and Dollard’s 
discussion of the learning of fear. For once, Hull goes into less de- 
tail than they do. For Hull, secondary drive is simply the attach- 
ment of drive characteristics to a neutral stimulus through the 
pairing of the neutral stimulus with a drive stimulus, Once this 
learning process has occurred, the secondary drive stimulus has 
the same effects on learning as a primary drive stimulus. Its pres- 
entation contributes to the individual's total drive level (D) and 
its removal is reinforcing. The outstanding example of a second- 
ary drive is fear. A neutral stimulus that is paired with pain soon 
acquires drive properties, and the drive it produces is called fear. 
According to Hull, however, a secondary drive can be learned 
through pairing a neutral stimulus with any primary drive. 

Since Hull assumed that all learning involves reinforcement, 
either primary or secondary, he faced a problem in stating the 
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corollary concerning secondary drive. If a secondary drive is 
learned through the pairing of a neutral stimulus with a primary 
drive, where is the reinforcement in that learning situation? Hull 
answered this question by specifying that the learning experience 
must include not only the neutral stimulus and a drive stimulus but 
also a sudden reduction in the drive stimulus. In other words, the 
learning of a secondary drive requires that three events happen 
in quick succession: presentation of a neutral stimulus, an increase 
in a primary drive stimulus, and then a decrease in the drive stim- 
ulus. This specification saved the principle that primary reinforce- 
ment always requires drive reduction, but raised other problems. 
It implies, for example, that a person will learn to fear a stimulus 
paired with a short stab of pain but not a stimulus paired with an 
equally intense long-lasting pain, since in the latter case there is no 
reduction in the pain for some time after the pairing. A number 
of studies have tested this and other such surprising predictions, 
generally with results unfavorable to Hull’s assumption. The fact 
that Hull made such an implausible assumption rather than give 
up the principle that all learning involves reinforcement is an ex- 
ample of the difficulties that a concern with theoretical consistency 
can cause. 

A final corollary that must be considered is concerned with the 
fractional antedating goa! reaction. This imposing expression can 
best be understood by analyzing it in parts, starting at the end. 
A goal reaction is a response of consuming a reward, which by 
definition is at the goal of a sequence of responses. Thus, when a 
rat eats food in the goal box at the end of a maze, it is making a 
goal reaction. An antedating goal reaction is one that occurs before 
the individual gets to the goal and involves an anticipation of the 
goal. (In an earlier version of his theory Hull used the term “antic- 
ipatory goal response.”) However, this antedating goal reaction 
obviously cannot be a complete goal reaction, since there is no food 
to be consumed until the goal is actually reached. The rat in the 
maze can anticipate the food by licking its chops and salivating, 
but cannot make the complete eating response until food is avail- 
able. These parts of the goal reaction that can occur before the goal 
is reached are therefore called fractional antedating goal reac- 
tions. 

The abbreviation for fractional antedating goal reaction is rg. 
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This is usually read as “little r G,” which is a sort of pun on the 
word “little.” It indicates both that the written letter r is lowercase 
and that the goal reaction referred to is fractional and hence small. 

How is the re acquired? Like any other responses, says Hull 
—by reinforced practice. When a rat eats food in the goal box of 
a maze, it is making goal responses that are reinforced by 
the drive reduction from the food. This goal reaction occurs in the 
presence of stimuli from the goal box, such as its color, smell, and 
floor texture, and thus is learned as a response to these stimuli. As 
a result there will be a tendency for the goal reaction to occur in 
response to these stimuli even in the absence of food. However, 
the stimuli in the rest of the maze are much like those of the goal 
box, so that there will also be a tendency for the goal reaction to 
occur to these stimuli while the rat is in the maze but not yet in the 
goal box. Since such an antedating goal reaction can only be 
fractional in the absence of food, it is the rg. 

Although Hull limited his formal consideration of re to such 
obvious examples as fractional eating responses, the concept can 
be applied to any goal response. In the case of humans, there might 
be fractional antedating reading responses, baseball-playing re- 
sponses, or money-pocketing responses. These could be condi- 
tioned, respectively, to the stimuli of the library, the sand lot, and 
the pay envelope. 

The important aspect of the re is that it produces stimuli which 
have a major role in guiding the individual’s behavior. A stimulus 
produced by a rg is called a sc. The function of se in Hull’s sys- 
tem is much like that of Guthrie’s movement-produced stimuli. 
The chief difference is that the sg in Hull's system has reinforcing 
properties, which would of course not be included in Guthrie’s sys- 
tem. The reinforcing properties of sc arise from the fact that these 
stimuli were closely paired with drive reduction during the origi- 
nal goal reaction (e.g., eating). This pairing resulted in their be- 
coming secondary reinforcers. As a result, any stimulus that pro- 
duces ro, and hence also sg, will have a secondary reinforcing effect. 


DERIVATION OF THEOREMS 


We have said that Hull’s goal was to derive the laws of behavior 
logically from a simple system of postulates. After considering the 
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number of concepts and assumptions that Hull packed into his 17 
postulates, the reader may not think that there is anything simple 
about the system. However, its relative simplicity is more evident 
if we note that A Behavior System contains 133 theorems, some 
with several parts, all following from the 17 postulates. Moreover, 
these 133 represent the limitations of Hull’s time and energy rather 
than the limitations of the theory—there could have been many 
more. There is no question that the system represents an accom- 
plishment of major proportions. 


Discrimination Learning 


Many of the derivations of theorems are quite complex, requir- 
ing a careful working-through of mathematical relationships.. For 
our purposes, however, a nonmathematical treatment of one of 
the simpler derivations will be enough to illustrate Hull’s ap- 
proach. Theorems 3 and 4 deal with a special case of discrimina- 
tion learning. In general, discrimination learning requires that the 
individual learn to respond to one of two stimuli and not to 
the other. The discrimination might involve a pigeon learning to 
peck at a red key but not at a green one, or it might involve 
a teacher learning to be firm with her pupils but not with the school 
administrators. In the special case considered in Theorems 3 and 4, 
the tendency to respond to one stimulus is much stronger at the 
beginning of training than the tendency to respond to the other, 
but only responses to the latter stimulus are reinforced during 
training. Learning the discrimination means that the individual 
must learn to respond to the stimulus which at first has a weak ex- 
citatory potential and not to respond to the one which initially has 
a high excitatory potential. The theorems predict the shape of the 
learning curve in this situation. 

The derivation of the theorems goes as follows: Each time the 
correct response occurs, its habit strength and hence its excitatory 
potential increase. Each time the incorrect response occurs, con- 
ditioned inhibition is built up (Hull ignores reactive inhibition 
here, assuming distributed practice) and the net excitatory poten- 
tial (sEr minus sIr) for that response is reduced. However, the 
two stimuli are somewhat similar (otherwise there would be no 
discrimination to learn), and generalization occurs between them. 
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When the individual is reinforced for responding to the correct 
stimulus, this reinforcement increases not only the habit strength 
for the response to that stimulus but also (to a much lesser ex- 
tent) the habit strength for the response to the other stimulus. Sim- 
ilarly, when a response to the incorrect stimulus occurs and is 
not reinforced, some of the conditioned inhibition generalizes to 
the correct stimulus-response connection and the net excitatory 
potential for that connection is reduced. 

It might seem at first glance that these two kinds of generaliza- 
tion would cancel each other out and make no difference. How- 
ever, it is necessary to take into account the relative number of 
times that responses to the correct and the incorrect stimuli occur. 
Early in learning, since there is much more sER for responses to the 
incorrect stimulus, responses to that stimulus will be much more 
frequent. As a result, generalized inhibition from nonreinforced 
incorrect responses will weaken the incorrect tendency faster 
than reinforced correct responses can strengthen it. Consequently, 
while sEp for responding to the incorrect stimulus is being rapidly 
weakened, sEx for responding to the correct stimulus is also being 
weakened, though much more slowly. However, since the sEr for 
the incorrect stimulus is being reduced more quickly, the tendency 
to respond to the correct stimulus is growing relatively stronger, 
so the probability (i.e., the relative frequency) of responding to 
the correct rather than the incorrect stimulus is increasing. The 
oftener the learner responds to the correct stimulus, the oftener 
he is reinforced for doing so and the more rapidly his sEx for doing 
so increases. The less often he responds to the incorrect stimulus, 
the less sIr generalizes from the incorrect stimulus to weaken the 
sEr to the correct stimulus. For both of these reasons, sEr for the 
correct stimulus increases at a faster rate as the learning goes on. 

As the response to the correct stimulus gets stronger, three 
things happen. One is that habit strength for the correct stimulus 
begins to approach its upper limit, according to the principle of di- 
minishing returns, and so the excitatory potential for response to 
that stimulus increases less and less rapidly. Second, more and 

more sEp generalizes from the correct to the incorrect stimulus, since 
more responses are now to the correct than to the incorrect stimulus. 
Third, since there are fewer responses to the incorrect stimulus, 
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sIr builds up less rapidly. The second and third of these events 
both serve to keep the excitatory potential for responses to the in- 
correct stimulus from getting any weaker. Since the correct ten- 
dency is increasing in strength more slowly and the incorrect ten- 
dency is no longer decreasing, the probability of responding to 
the correct stimulus now increases more slowly than before. 

When we put all these considerations together, we can see that 
the probability of responding to the correct stimulus rather than 
the incorrect one increases slowly at first, then more rapidly, and 
then slowly again. This variation is shown graphically in the S- 
shaped learning curve in Fig. 6, which shows the strength of the 
response at each stage of learning. Learning curves with this 
shape are called sigmoid, from the Greek letter sigma. This sig- 
moid curve is different from the learning curve when the rein- 
forced response is originally the stronger, and also different from 
the theoretical curve for the building up of habit strength for a sin- 
gle response, given in Postulate 4. Both of these other curves rise 
most rapidly at first and gradually slow down as learning pro- 
gresses (Fig. 6). Thus the sigmoid learning curve is predicted for 
discrimination learning only in this special case where the correct 
connection is weaker in the beginning than the incorrect one, and 
it takes a fairly complicated line of reasoning to make the predic- 
tion. However, the prediction is consistent with experiments. In a 
number of different kinds of learning, it has been found that tasks 
on which learners do badly at first give sigmoid curves, whereas 
those on which the learners do fairly well at first give curves with 
rapid rises at the beginning. Thus Hull's prediction, specialized 
and technical though it is, is confirmed by experimental data. 

In the derivation of Theorems 3 and 4, Hull did not merely 
present the verbal reasoning given here. He also worked through 
a numerical example, computing actual values of excitatory po- 
tential and conditioned inhibition and using these to determine the 
probability of a correct response for an average learner at each 
stage of practice. (Because of oscillation, he could not predict to 
any particular individual learner.) He was thus able to give, not 
merely the general shape of the learning curve, but exact proba- 
bilities for the particular values of sEx that he used in his example. 
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Cognitive Predictions 


In addition to numerical predictions like this one, Hull also 
derived less exact theorems dealing with more complex forms of 
learning. He was well aware of the challenge to connectionist the- 
ory that was posed by Tolman, Lewin, and other cognitive psy- 
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Fic. 6. Two shapes of learning curves. Curve A is sigmoid, the shape 
predicted by Hull for the case where the wrong choice has a greater sEr 
at the beginning of training than the correct choice. Curve B is negatively 
accelerated, the shape Hull predicted for the case where the correct 
choice is initially the stronger. Curve B resembles the growth curve for 
the sEr of a single response. 


TRIALS 


chologists, and he wanted to show that the kinds of behavior these 
cognitive psychologists studied could be shown to follow from 
his theory. He therefore derived a number of theorems dealing 
with movement in free space and with insightful problem solv- 
ing. In deriving these, he made great use of the fractional an- 
tedating goal reaction (rc). This proved to be a powerful tool for 


DERIVATION OF THEOREMS 151 


interpreting insightful behavior in connectionist terms. When Tol- 
man says an individual has a cognition that food is to be found 
in a certain place, or when Lewin says that a certain region in the 
life space has positive valence because it contains food, Hull 
can say that the stimuli of this place elicit rẹ for food. Fractional 
antedating goal reactions can thus be seen as cognitions about 
the location of goal objects. In making such interpretations, Hull 
was following the tradition of Watson and Guthrie, reducing ideas 
and expectations to small movements of parts of the body. How- 
ever, he went into more detail than Watson or Guthrie, and in 
doing so reduced the gap between connectionist and cognitive 
interpretations of learning. 

By using ro, Hull could predict that an individual might ap- 
proach a goal object by a route that he had never before used. 
Once the individual saw the goal object, or any stimuli commonly 
associated with it, rg would be aroused by these stimuli. As a re- 
sult, the individual would tend to approach the goal object, re- 
gardless of whether the route by which he had approached it was 
one he had ever before used for approaching that goal. His behav- 
ior would thus conform to Lewin’s prediction, that he should tend 
to approach a region of positive valence from whatever direction 
and by whatever means the situation required. It would also fit Tol- 
man’s interpretation of behavior as a seeking of goals rather than 
a running off of habits. Approaching old goals by new techniques 
is thus one phenomenon, emphasized by cognitive psychologists, 
with which Hull is able to deal. 

Closely related to new approaches is the matter of detours. 
Much of the emphasis in gestalt interpretations of learning is on 
finding new ways to reach goals when the old familiar ways are 
blocked. The new ways involve finding detours around barriers, 
either in physical space or in Lewin’s topological life space. Ge- 
Staltists interpret such detour solutions as involving a restructur- 
ing of the life space, a realization that it is possible to get around 
the barrier. Hull concerns himself only with physical space and 
concentrates on the trial-and-error process by which the detour 
is discovered. The inexperienced individual, when confronted by 
a barrier, is likely to spend some time in futile attempts to pene- 
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trate the barrier and continue directly on his way to the goal. As 
reactive inhibition builds up and partially inhibits this direct re- 
sponse, says Hull, the oscillations of this and other responses oc- 
casionally make some other response momentarily stronger. Even- 
tually this process results in the individual accidentally finding 
a route around the barrier. The response of taking this indirect 
route is reinforced and thus becomes more likely to occur if the in- 
dividual again encounters that barrier on the way to that goal. Hull 
derived a number of theorems about the factors that would make 
such detour learning easier or more difficult. This kind of learn- 
ing involves problem solving, but without insight. 

For an individual experienced in free space, however, the situa- 
tion is different. In the course of ordinary life, individuals often 
reach the same goal by different routes. Different habits leading 
to the same goal come to form a habit family. All members of the 
habit family have ra for a certain goal in common. When one of 
these habits is blocked, another member of the habit family can 
readily take its place, so that progress toward the goal can continue. 
Those habits in the family that obtain the goal most quickly and 
easily are preferred, both because they involve less delay of rein- 
forcement and because they result in less reactive and conditioned 
inhibition. The habits in the family are thus arranged in a hier- 
archy of preference, ranging from the most to the least preferred. 
A less preferred habit will ordinarily be used only if all the more 
preferred ones are blocked. This arrangement of habits is known 
as the habit-family hierarchy. We might think of the habit- 
family hierarchy as Hull’s equivalent of a cognitive map. In mak- 
ing use of it to explain detour behavior, Hull is in effect saying that 
the ability to show cognitive behavior, to act as if one has a cogni- 
tive map, is the result of prior learning. He is reinterpreting cogni- 
tive behavior as a product of prior connectionist learning. 

The addition of incentive motivation to Hull’s system provided 
him with another convenient tool for reinterpreting insightful 
learning in his own terms. One of the points that Tolman empha- 
sized was the distinction between learning how to get to a given 
place and learning whether to expect a reward there. In making 
K separate from sHr, Hull conceded this point. Learning how to 
get somewhere or do something is a matter of habit strength, which 
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builds up slowly. Learning where reward is located is a matter of 
incentive motivation, which both increases and decreases rapidly. 
It is thus possible for sHr to reach a high level while K remains 
very low. In such a situation, the introduction of a large reward 
would raise K and thus result in a sudden improvement in per- 
formance. This response would constitute a form of latent learning, 
one of Tolman’s favorite phenomena, but it would also be consistent 
with the newer version of Hull’s theory. 

These applications of Hull’s theory to complex, insightful phe- 
nomena of learning do not by any means close the gap between 
connectionist and cognitive interpretations. For one thing, some 
learning phenomena emphasized by cognitive psychologists are 
still largely beyond the reach of Hullian explanations, Buxton’s 
latent learning experiment, described in our discussion of Tol- 
man, is very difficult to explain in Hullian terms, even with the aid of 
habit-family hierarchies and fractional antedating goal responses. 
For another thing, these derivations of Hull’s are less precise than 
many of his others, and there is room for some dispute as to how 
rigorously the theorems really follow from the postulates. Never- 
theless, Hull has gone far enough in closing the gap between the 
two kinds of theory to point the way toward a possible eventual 
amalgamation, in which cognitive principles would be accepted as 
valid but would be interpreted as the result of prior learning of a 
connectionist sort. Such an amalgamation is still far off, but not 
as far off as it once appeared. 


STRENGTHS AND WEAKNESSES OF THE SYSTEM 


Hull saw the value of his theory not so much in the particular 
intervening variables it included as in its rigorous quantifica- 
tion. The postulates do not merely state that certain variables are 
related; they give equations by which one can be computed pre- 
cisely from the other. For example, the postulate regarding habit 
formation contains the equation sHr = 1 — 10-9358 Where Ñ 
is the total number of reinforcements. Using this equation, one can 
State the exact level of habit strength for any given number of re- 
inforcements. Similar equations are presented connecting the 
other second-stage intervening variables to the independent vari- 
ables and also connecting excitatory potential to the dependent vari- 
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ables. The equation connecting the second-stage intervening vari- 
ables to sEr, which involves simple multiplication, has already 
been given. 

Hull dedicated much of his effort in the last years of his life to 
the problems of quantification. He wanted to develop a scale of 
measurement for excitatory potential that would be applicable to 
any response. How can sEp for pressing a lever be made compar- 
able to sEr for running down an alley? Hull’s answer (which 
will be meaningful only to those readers who have studied 
statistics) was to use the standard deviation of the oscillations of 
sEr as a measuring unit. This use of the standard deviation as a scal- 
ing unit is an idea Hull took over from the specialists in mental test- 
ing. He could then say that sEp for a given response was two stand- 
ard deviation units above zero, and this statement would have 
the same meaning regardless of what response he was talking 
about. This unit is also convenient for analyzing competition be- 
tween responses, since knowing the sEp for each response in stand- 
ard deviation units also automatically tells how much their ranges 
of oscillation overlap and hence how often each will occur, Hull 
regarded the development of this scaling unit as one of the lead- 
ing achievements of his life. 

The evaluation of Hull’s work by others has, however, been 
rather different from his own evaluation. It is just on the matter of 
quantitative rigor that he is most vulnerable to criticism. To some 
extent the reason is that his attempts at quantification were pre- 
mature. The exact values in his equations, such as the —.0305 
in the equation for habit strength, were typically based on the re- 
sults of a single experiment. He made the suggestion, but never 
developed it, that the values in the equations might vary from in- 
dividual to individual. Thus different people might have different 
equations for the development of habit strength, with the specific 
value being greater than —.0305 for fast learners and smaller 
for slow learners. Moreover, when Hull came to derive theorems 
from his postulates, he often used different values in his equations 
from the ones given in the postulates. For all of these reasons, 
it seems clear that these values were intended to be illustrative 
rather than literally correct. What Hull attempted to present in his 
books was the general outline of a rigcrous theory together with 
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some suggestions as to how the outline could be developed into a 
full-scale, quantitatively exact system. He was still a long way 
from actually having such a system when he died. 

Even if we leave aside the matter of precise, quantitative values, 
the system does not live up to the ideals that Hull set for it. In a 
system such as Hull's, it should be possible, given the values of 
the independent variables, to compute the values of the interven- 
ing variables and from these the dependent variables. Even if the 
precise numbers needed to make the computations are not avail- 
able, it should be possible to explain how the computations would 
be made. In many of Hull’s cases, however, it is not possible, be- 
cause a given topic is either covered by two or more contradictory 
postulates or is not covered at all. An example of the latter kind of 
problem is the definition of incentive motivation (K) when the 
drive is a painful stimulus, such as electric shock. Presumably in 
that case K would be determined by the amount of pain reduc- 
tion that a response produced, perhaps specified by the difference 
in volts between the shock received before the response and the 
smaller shock received afterward. However, since K is defined 
by the weight of food or other nutriment consumed, it is impossi- 
ble to apply Hull’s equations to the situation where painful stimula- 
tion is the drive. 


How Do We Predict Extinction? 


Let us consider in detail a case where several postulates can ap- 
ply to the same situation and lead to conflicting predictions. Sup- 
pose we ask how many nonreinforced responses will occur before 
complete extinction, when sE, is at a given level at the end of ac- 
quisition. There are three ways that we might do this. First, 
we might use Postulate 16, which relates excitatory potential to 
the number of nonreinforced trials necessary to produce complete 
extinction. To use this ,postulate, we would compute habit 
strength, drive, and incentive motivation at the end of acquisi- 
tion and multiply them together to get the excitatory potential for 
the response at the beginning of extinction. We would then use the 
equation in Postulate 16 to compute how many trials to extinction 
would be produced by that level of sEx. This would be the most 
straightforward approach. 
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However, since Hull used reactive and conditioned inhibition 
as explanations of extinction, we might instead use these inter- 
vening variables to compute the number of trials to extinction. 
Knowing the general outline of Hull's system, we would expect 
to be able to compute the amounts of Ip and sly built up by any 
given number of nonreinforced trials requiring a given amount of 
effort. However, when we consulted Postulate 9, dealing with re- 
active inhibition, we would soon find that it is not stated in such 
a way as to make these computations possible. There is one equa- 
tion relating amount of reactive inhibition to number of nonrein- 
forced trials, and another equation relating amount of work 
to number of nonreinforced trials before complete extinction. 
Neither equation takes account of all the information we ought 
to have if we are to make full use of the intervening variables in the 
theory. We might use either of these incomplete equations to com- 
pute resistance to extinction, and in either case we would get a dif- 
ferent answer from the one we obtained using Postulate 16. 

Still a third alternative would be to use the postulate (unnum- 
bered, since in his haste Hull left it out of the postulate set and had 
to add it later in a footnote) that discusses changes in incentive 
motivation (K). With no reward, K would rapidly approach zero. 
Since sEr = sHr X D X K, sEr would also rapidly approach zero. 
Within a few trials, sEr would be so small as to be below the 
threshold, and no response would occur. This approach thus gives 
an answer even more different from the other two than they were 
from each other. Whereas both other approaches at least lead to 
the prediction that extinction would be gradual, this approach pre- 
dicts very rapid extinction. We are thus more than ever in doubt 
as to what Hull’s theory really has to say about how long it will 
take to extinguish a given response. 

The mere fact that these three ways of answering the question 
would give three quantitatively different answers is not in itself a 
crucial problem, since we have seen that the particular numerical 
values Hull used were mainly illustrative. However, the lack of in- 
ternal consistency which this illustration points up is a serious 
weakness in the system, probably more so than any other 
flaw. When a theory makes incorrect predictions, it can be modi- 
fied, as Hull intended that his theory should be. When a theory 
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does not deal with a given issue at all, we can accept this limitation 
in its scope and hope that some day it may be expanded to include 
the neglected topic. However, when a theory is internally incon- 
sistent, so that it makes conflicting predictions about a given is- 
sue, its worth as a rigorous theory is seriously compromised. To 
some extent we can blame Hull’s failure in this regard on the re- 
visions in his theory. For example, when the intervening variable 
K was added to the system in a later revision, this change called 
for a number of adjustments in other postulates, but some of these 
were never made. Racing against time in the last years of his life, 
Hull considered it more important to show what could be done 
with parts of his system than to make the system as a whole com- 
plete and consistent. Nevertheless, we must recognize that the the- 
oretical system Hull actually created falls far short of the stand- 
ards he himself set for theories. 


Hull's Place in History 


By the irony of history, it appears that Hull failed where 
he most wished to succeed and succeeded in the respect that in- 
terested him least. He wanted both to build a deductive theory, at 
once broad in scope and rigorous in detail, and to encourage oth- 
€rs to carry on the same sort of theoretical work. We have seen 
how far he fell short of his first aspiration. Moreover, his failure 
has probably discouraged others from attempting such a task. 
Many observers have noted a trend in learning theory away from 
the sort of all-encompassing theories that we have been discussing 
so far, toward theories of smaller scope designed to explain only 
certain kinds of learning. This trend may well reflect in part Hull’s 
failure. For many years, psychologists have dreamed of doing for 
their field what Newton did for physics—developing a theory that 
would be at once vast in its scope, precise in its applications, and 
elegant in its simplicity, a theory that would pull together the loose 
ends of psychology into one master system. In pushing so far to- 
ward this goal and yet falling so far short of it, Hull convinced 
many psychologists that the day of such a master theory in psy- 
chology is still far off. 

Yet in his failure, Hull also achieved success beyond that of any 
other recent learning theorist. His terms, his interests, and his ways 
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of formulating psychological questions have become more wide- 
spread than those of any other theorist. More experiments have 
probably been inspired by his work than by anyone else’s in the 
past thirty years. His interpretations of drive, reinforcement, ex- 
tinction, and generalization are standard starting points for dis- 
cussion of these topics. He has been attacked, defended, and elab- 
orated until to many people “learning theory” and “Hullian the- 
ory” have become synonyms. The system of terms that he intro- 
duced has been aptly referred to by Cotton as “the Esperanto of 
psychology.” (Only in the last few years has Skinner come to rival 
Hull in this respect.) His outstanding position in the psychology of 
learning, as well as the complexity of his system, justifies devoting 
this chapter to him alone. In addition, there are various other 
interpretations of learning that owe much of their inspiration to 
Hull. We have already considered one of these, Miller’s, and we 
will see more in the next chapter. 


CONNECTIONIST MOVES 
IN COGNITIVE 
DIRECTIONS 


So FAR we have devoted considerably more space to connectionist 
than to cognitive theories, for connectionists have always occupied 
the mainstream of learning theory. To a great extent cognitive 
theorists of learning have worked on the periphery, criticizing the 
formulations of the connectionists rather than presenting equally 
influential systems of their own. However, this does not mean that 
the cognitive theorists have had little influence. Rather, their 
influence has been largely through their effect on the directions of 
connectionist theory. By pointing to problems that the connec- 
tionists tended in their enthusiasm to overlook, the cognitivists have 
often helped to direct the mainstream into more cognitive channels. 
In this chapter we will look at some current interpretations that 
attempt to gain for connectionist theory some of the flexibility and 
power that cognitive theory has traditionally had in dealing with 
complex phenomena. 


SPENCE’S SYSTEM 


Almost by definition, a system such as Hull’s is never 
completely finished. As new laws are established by new experi- 
ments, the postulates are almost certain now and then to require 
some modification. Hull’s system at his death was in a particularly 
unfinished state, however, since changes which Hull made in the 
last few years of his life had not yet been carried through to their 
logical conclusions. The system thus suffered, as we have already 
seen, from internal inconsistencies. The job of straightening out 
the inconsistencies and reaching the logical conclusions was un- 
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dertaken by Kenneth W. Spence (1907-1967). Spence was a prize 
student of Hull’s at Yale and afterwards spent most of his profes- 
sional career at the State University of Iowa, though the last few 
years of his life were spent at the University of Texas. For two 
decades before Hull’s death, Spence was a major contributor to his 
thinking and a great producer of research relevant to the theory. 
When Hull died, Spence continued the tradition in which he had 
long been a major figure. His recent contributions to that tradition 
are presented in his book Behavior Theory and Conditioning, 
originally presented as a series of lectures at Yale. 

In spite of their close connection, Hull and Spence differ some- 
what in their approach to theory. Though Spence is a great expo- 
nent of theory construction, he is more cautious about it than Hull. 
His theory, though it can readily be seen as a revision of Hull’s, 
is less elaborate, less formally stated, and more obviously ten- 
tative than Hull’s. Even though Hull expected to keep modifying 
his theory in the light of new evidence, he stated it in the form of a 
formal, finished structure. Spence states his more casually, leav- 
ing more of the details to be settled by experiment. In particular, 
Spence regards Hull’s inclusion of specific numerical equations 
in his postulates as highly premature. With our present informa- 
tion, or lack of it, he believes that such specific equations can only 
represent misleading pseudo precision. Spence is also more con- 
cerned than was Hull about the boundary conditions of a the- 
ory. These are the limits within which a theory is expected to op- 
erate. For example, a theory of education that worked well for 
explaining and predicting what happens in middle-class Ameri- 
can schoolrooms might fail completely if applied to education 
among the Bushmen, or perhaps even to education in American 
urban ghettos, because education in these other situations is a 
topic outside the boundary conditions of the theory. Similarly, a 
theory intended to deal with learning by animals or human babies, 
where language does not enter in, might not be appropriate for 
learning by school children or adults. Therefore a theory should 
specify the conditions to which it is and is not intended to apply. 
Thus in various respects Spence does not set his aspirations for 
theory quite as high as Hull, but he is nonetheless a major advocate 
of theory construction as vital to the development of science. 
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The chief innovations in Spence’s system as compared with 
Hull’s are those concerned with K, the incentive motivation fac- 
tor. At Hull’s death this had still not been fully assimilated into 
his system. Spence undertook to clarify the status of K, and in do- 
ing so made radical changes in the whole basis of the theory. 


The Relation of K to re 


Hull intended to define all of his intervening variables in terms 
of the independent variables that produced them and the depend- 
ent variables that they influenced. Nevertheless, he was quite will- 
ing to speculate about what physiological changes inside the or- 
ganism might be involved. Habit strength might be related to 
some change in the neural connections between the sense organ that 
receives a stimulus and the muscle or gland that makes a response. 
Drive might be related to a bodily need (as Hull himself sug- 
gested) or to a strong stimulus (as Hull suggested and Miller em- 
phasized). But what of incentive motivation? Is there any bodily 
process to which this construct can be related? Yes, says Spence, 
there is the fractional antedating goal reaction. Hull never clari- 
fied the relationship between K and re, but it seemed to Spence 
that such a relationship existed and should be made explicit. The 
Ta is assumed to be a response, but one differing from other 
responses in that the sensations it produces are reinforcing. If an in- 
dividual has eaten a large quantity of food several times in a cer- 
tain place, the ro of eating will be strongly conditioned to the stim- 
uli of that place. Whenever he observes these stimuli, rq will oc- 
cur and produce sq. If he had eaten only a little food on previous 
occasions in that place, the ro and sg would be only weakly 
evoked. If he had never eaten there at all, they would not be evoked 
at all. Note, however, that the same condition (amount of food) 
which determines the strength of ro also determines the magni- 
tude of K. Let us say, then, suggests Spence, that K is nothing more 
than the total strength of rq and the associated sg. By combin- 
ing into one of these two concepts, K and re, we have simplified the 
theory and eliminated some of the uncertainty as to just where K 
fits. 

This change, however, has implications beyond a mere simpli- 
fication. For although K and rg in Hull’s system have so much in 
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common, they differ in one important respect. For Hull, K is re- 
lated to a stimulus-response connection: the greater the food re- 
ward for making a given response to a given stimulus, the greater 
the K for that stimulus-response connection. On the other hand, 
To is related only to a stimulus: the more the eating that has oc- 
curred in the presence of a stimulus, the greater the tendency to 
make ro in response to that stimulus. By treating K as nothing 
more than rg, Spence has related K purely to the stimulus. Thus 
the incentive component in the equation for excitatory potential 
is not the incentive for making a response but the incentive for get- 
ting to certain stimuli. If we should refer to these stimuli that evoke 
To as “signs that food is near,” the similarity of this theory to Tol- 
man’s would be emphasized. 

A closely related innovation in Spence’s system is concerned 
with the building up of habit strength. Hull, in the final version of 
his theory, made habit strength a function of the total number of 
times that the response had occurred in the presence of the stimu- 
lus and been reinforced. The size of the reinforcement, however, 
made no difference. Is it reasonable, we may ask, to assume that 
there is no difference in this respect between a huge reward and 
a tiny one, but a very important difference between a tiny one and 
none at all? Spence solves this problem by assuming that H (he 
omits the subscripts) does not depend on reinforcement. When- 
ever a response is made in the presence of certain stimuli, he says, 
H increases. Thus H depends only on practice, not on reinforce- 
ment. Reward operates only through K (that is, ra), not through 
H. 

With these changes from Hull, what does Spence’s system look 
like? As with Hull's, Spence’s H, D and K combine to produce 
E. For reasons that need not concern us here, D and K are added 
together instead of multiplied, so that the equation becomes 
E=H(D+X). As with Hull, various other components enter 
into the full equation, but this abbreviated form will be adequate 
for our purposes. In this formula, H depends only on the total 
number of times that the response has been made in the presence 
of the stimuli, while K depends on the extent to which goal re- 
sponses have occurred previously in the presence of the stimuli. 
Thus drive reduction does not enter into the equation at all, and 
reinforcement enters only indirectly through rg. 
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As a final complication, it must be pointed out that the above 
applies only to the learning of responses that lead to rewards 
which can be consumed. Where a response is rewarded by escape 
from a noxious stimulus (or is negatively reinforced, in Skinner’s 
terms) different laws apply. As in Hull’s theory, these have not 
been worked out as fully as those for positive reinforcement, but 
one point stands out: drive reduction is more important in the 
learning of these responses. The experiments by which Spence 
supports this interpretation are ingenious, but it is not necessary 
to report them here. 

It is hoped that the astute reader will have noted the many re- 
semblances between Spence’s theory and those of a variety of 
other theorists we have discussed. Like Skinner’s, Spence’s sys- 
tem is in a practical sense a stimulus-response reinforcement the- 
ory. However, Spence attempts to analyze the nature of reinforce- 
ment, which Skinner does not, and in the process sounds much 
like Guthrie. Both H and K depend only on practice: in one case 
practice in making the instrumental response, in the other case 
practice in making rg. In that sense, Spence’s theory is as much a 
pure contiguity theory as Guthrie’s. Moreover, ro is an example 
of movement-produced stimuli contributing to the maintenance 
of goal direction, a thoroughly Guthrian notion. Finally, and most 
interestingly, Spence has a marked resemblance to the cognitive 
theorists, particularly Tolman. Though in his earlier years Spence 
was an outspoken critic of cognitive interpretations, his own work 
has led him over the years to a theory that lends itself well to cog- 
nitive purposes. 

Spence’s theory deals with phenomena of interest to cognitive 
Psychologists in much the same way that Hull does, but his theory 
is even better adapted to doing so than Hull's, because Spence has 
eliminated reinforcement as a fundamental concept (at least so 
far as positive rewards are concerned). In place of reinforcement, 
he has the rg mechanism, which can easily be regarded as the an- 
ticipation of reward. This reasoning completes the separation, 
begun by Hull, between learning about what paths lead where and 
learning about where reward is located. In accepting this distinc- 
tion, which is central to Lewin and Tolman but absent in Skinner 
and only hinted at in Guthrie, Spence made a large stride toward 
a more cognitive interpretation of learning. Though often a critic 
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of others’ formulations, Spence has included elements of more 
other systems in his own system than has any other theorist of learn- 
ing except perhaps Tolman. 


MOWRER’S INTERPRETATIONS OF LEARNING 


For all his similarities to cognitive theory, Spence sticks firmly 
to his connectionist terms and to his insistence that cognitive be- 
havior is basically a matter of responses and stimuli. Another well- 
known theorist, who also began his work somewhat in the Hull- 
ian tradition, has moved toward cognitive interpretations in his 
terms as well as in his ideas. This is O. Hobart Mowrer (b. 1907), 
currently a professor at the University of Illinois. Originally 
noted, like Miller, for his applications of learning theory to per- 
sonality, Mowrer has recently been best known for his concern 
with the place of sin in psychopathology. In addition, however, 
Mowrer has attained note as a theorist of learning and of thinking. 
Beginning with attempts to resolve certain problems within con- 
nectionist theory, he has become more and more a cognitive 
theorist as he has proceeded. 


Mowrer’s Older Two-Factor Theory 


The problem which Mowrer originally tried to solve was con- 
cerned with the nature of reinforcement. Is it, or is it not, a matter 
of drive reduction? Mowrer agreed with Hull and Miller that the 
answer is usually yes. The events that are reinforcing are the ones 
that relieve hunger or pain or fear or some other noxious state. 
However, Mowrer was particularly concerned with the learning of 
emotional reactions, and here the situation seemed to be differ- 
ent. When Miller’s rats learned to be afraid of the white compart- 
ment, or when a burnt child learns to fear the fire, what is the re- 
inforcement for this learning? A common-sense answer would be 
that the pain, of electric shock or of a burn, is the reinforcer. Hull 
and Miller did not accept this common-sense interpretation, but 
maintained instead that it was a reduction in pain immediately 
afterward that served as the reinforcer. We have already seen 
how Hull and Miller tried, by this interpretation, to maintain a 
consistent drive-reduction explanation of reinforcement. To 
Mowrer, however, the common-sense explanation seemed pref- 
erable. It seemed unreasonable to him that the emotional re- 
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sponses a rat or a person makes when hurt are reinforced by the 
termination of the pain. If that were true, brief punishment would 
reinforce any response, while lengthy punishment would have no 
effect on it. Everyday experience contradicts this idea, and Mowrer 
did a number of experiments that further contradicted it. These 
results led Mowrer to formulate a two-factor theory of reinforce- 
ment. 

Mowrer’s two-factor theory was much like Skinner’s distinc- 
tion between the conditioning of reflexes and the learning of oper- 
ant behavior. Several people prior to Mowrer had, in fact, suggested 
that classical and instrumental conditioning are basically differ- 
ent kinds of learning. It remained for Mowrer, however, to sharpen 
the distinction and to emphasize its importance for the under- 
standing of all learning situations. 

Mowrer referred to his two forms of learning as sign learning 
and solution learning. The two are sharply demarcated in various 
ways. Sign learning involves classical conditioning of involuntary 
responses of the smooth muscles in the internal organs and of the 
glands. These are what we call emotional responses, and the most 
important example is fear. An unconditioned stimulus, such as 
pain, elicits these involuntary fear responses. Any stimulus that is 
paired with the unconditioned stimulus can then become a con- 
ditioned stimulus for the fear responses. Thus the unconditioned 
stimulus that elicits the emotional response is also the reinforcer in 
the conditioning of fear to new stimuli. This form of learning 
is called sign learning because the conditioned stimulus becomes 
a sign of danger and thus elicits an emotional response. 

Solution learning is the instrumental conditioning of re- 
sponses that reduce drives. It involves voluntary responses of the 
striped, skeletal muscles. The reinforcement for this form of 
learning is the reduction of the drive. It is called solution learn- 
ing because the response solves the problem posed for the individ- 
ual by the drive. 

So far Mowrer seems to be adding very little to what Skinner 
had already said. He has substituted the terms sign learning for 
the conditioning of respondent behavior and solution learning for 
the learning of operant behavior. Also, he has restricted sign 
learning to the emotional responses of smooth muscles and glands. 
The importance of his contribution is in suggesting the relation- 
ship between these two kinds of learning. In a great many cases 
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the drive that is reduced in solution learning is a secondary drive 
that was previously acquired by sign learning. The learning proc- 
ess thus has two stages. In the first, fear (or possibly some other 
emotional response) is conditioned to some stimulus by a process 
of sign learning. This fear response produces stimuli that act as a 
drive. In the second stage, an instrumental response that reduces 
the fear drive is learned by the process of solution learning. The 
instrumental response reduces the fear by getting the individual 
away from the conditioned stimulus that arouses the fear. 

Consider as an example a child who is bitten by a dog. Since 
the unconditioned stimulus of pain from the bite produces emo- 
tional responses characteristic of fear, the sight of the dog becomes 
a conditioned stimulus for fear. This event is sign learning. On a 
later occasion, the sight of the dog (or perhaps, through generali- 
zation, the sight of any dog) produces fear, a secondary drive. If 
the child now runs away, thus escaping from the fear-producing 
sight of the dog, this running response will be reinforced by fear 
reduction. This event is solution learning. Both stages of the 
learning process must be considered in order to understand the 
child’s reaction to the dog. 

In principle, either stage of the process can occur without the 
other. In practice, however, it is doubtful if they ever occur inde- 
pendently. When a drive is conditioned (stage 1), there is usually 
some response available that will reduce it at least slightly (stage 
2). Moreover, primary drives for which one learns an instrumen- 
tal response (stage 2) are likely to have some component of sec- 
ondary drive resulting from previous learning (stage 1). For ex- 
ample, when a person in the dentist’s chair feels a slight pain, 
the resulting drive is due only partly to the pain itself, and partly 
to fear. The fear in turn results from the fact that in the past slight 
pain has sometimes been followed by greater pain and thus be- 
come a conditioned stimulus for fear appropriate to the greater 
pain. Thus pain combines with fear of pain to produce the total 
drive. A reassurance, such as the dentist’s saying, “I’m almost 
done,” can reduce the fear and hence reduce the total drive, even 
though it cannot reduce the actual pain. 

This two-stage analysis of learning by Mowrer sounds very 
much like that presented by Dollard and Miller. However, by treat- 
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ing the acquisition of fear as sign learning, Mowrer avoids Dollard 
and Miller’s problem of finding a drive-reducing reinforcer 
ing the acquisition of fear as sign learning, Mowrer avoids Miller 
and Dollard’s problem of finding a drive-reducing reinforcer 
for the acquisition of the fear response. For Mowrer, it is the on- 
set of the drive, serving as an unconditioned stimulus, that re- 
inforces the fear response in stage 1, whereas Miller and Dollard 
would say that the end of the drive reinforces the fear response. 
Mowrer’s view is more consistent both with common sense and 
with experimental evidence. On the other hand, such a sharp dis- 
tinction between two kinds of learning seems artificial to many 
people, especially in view of the fact that such principles as extinc- 
tion, generalization, and discrimination apply to both. However, 
as the reader can see by comparing the discussion of Miller and 
Dollard with the discussion of Mowrer, in most situations it makes 
little difference which assumption one makes. Although our under- 
standing of the learning process will be very incomplete until this 
issue (along with many others) is resolved, many practical appli- 
cations are possible without the answer to this theoretical question. 


The Newer Sign-Learning Theory 

Since the original presentation of his two-factor theory in 1947, 
Mowrer has modified it by pointing out that solution learning 
has many of the characteristics of sign learning. Suppose an in- 
dividual learns not to do something for which he has been punished. 
According to two-factor theory, in stage 1 of this learning the sen- 
sations produced by his own behavior became a conditioned stirn- 
ulus for fear. In stage 2, the response of changing his behavior 
away from that which was punished was reinforced by reduction 
of the fear, and learned. In stage 1, sensations from his own behav- 
ior became danger signs. Why, asks the new Mowrer, should we 
not say the same thing about stage 2: that sensations from the in- 
dividual’s own behavior became signs of hope? When the fearful 
person felt himself making the new, never punished response, he 
felt safer, because the sensations from this response had previously 
been followed by a reduction in fear. This new interpretation of 
Mowrer’s replaces his older two-factor theory with a new, cog- 
nitive-sounding one-factor theory. All learning, in this new ver- 
sion, is sign learning. 

Through learning, according to this theory, stimuli become 
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signs of fear or hope or disappointment or relief. Even the acquisi- 
tion of bodily skills can be interpreted as sign learning. When we 
learn certain movements in riding a bicycle, for example, we are 
learning that certain sensations from our bodies mean we are doing 
it right, while other sensations indicate that we are about to tip 
over and must act quickly to keep our balance. The important thing 
about these stimuli is not that they elicit certain responses but that 
they have certain meanings, that they are signs of success or of 
impending failure in our attempt to ride. The history of Mowrer’s 
thinking, in coming to this position, is covered in his 1960 book, 
Learning Theory and Behavior. 

This new interpretation of learning makes Mowrer more cogni- 
tive than any of the other theorists we have discussed who devel- 
oped their systems within the connectionist tradition. It is there- 
fore interesting to compare him with Tolman, the cognitive theo- 
rist who is closest to the connectionists in his interests and objectives. 
The two are alike in one key respect: both regard all (or at least 
most) learning as sign learning. They are also alike in being more 
concerned than most cognitive theorists with the relationships be- 
tween independent and dependent variables in the external 
world. In some respects, however, they differ. For one thing, Tol- 
man goes farther than Mowrer in emphasizing cognitive pattern- 
ing. Mowrer talks about sign learning but not, like Tolman, about 
sign-gestalt learning. Neither does Mowrer discuss the complex 
organizations of cognitions that Tolman calls cognitive maps. 
Mowrer’s illustrations tend to be simple ones, with little concern 
about the organization of the life space. For another thing, 
Mowrer is still largely a reinforcement theorist. He is concerned 
with signs mainly as they point to increases or decreases in drive. 
The signs he talks about are signs of hope or fear or disappoint- 
ment, not signs that path A leads to point B. For both of these rea- 
sons, Tolman’s concept of the field expectancy has no real counter- 
part in Mowrer’s system. Instead, Mowrer is concerned with a fire 
alarm as a sign of danger, or with the jingle of money in one’s pocket 
as a sign of hope, or the like, not with the signs that provide our 
complex cognitive orientations. Thus Mowrer is less cognitive than 
Tolman, but the difference seems to be more one of emphasis than 
of real disagreement. 
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MEDIATING RESPONSES 


So far in this chapter we have seen two examples of connectionist 
theory being broadened to incorporate aspects of the cognitive ap- 
proach, each primarily the work of one theorist. We turn next to a 
general approach that has taken many forms and been advanced 
by many, people, without any one person emerging as its main 
exponent. A number of the major theorists we have discussed in 
this book have made use of this approach, as have many other 
theorists we have not mentioned. This approach is the use of what 
are known as mediating responses. 

We have seen that Hull and Spence made great use in their 
theories of fractional antedating goal responses and the stimuli they 
produced. Guthrie made somewhat similar use of movement- 
produced stimuli, while Miller’s interpretation of secondary drive 
involved strong stimuli produced by the individual’s own emotional 
responses. In all of these cases, the primary effect of a response 
was to produce the stimuli for further responses, thereby producing 
a mediating link between the original external stimulus and the 
final observable response. Such responses are therefore known as 
mediating responses. 


Mediating responses and the stimuli they produce serve a variety 
of functions in theories of learning and motivation. First, there are 
drive functions. For example, conditioned emotional responses 
produce drive stimuli and thus serve as the basis of secondary 
drive. This function is emphasized by Dollard and Miller and by 
Mowrer. Second, there are incentive functions. For example, 
fractional antedating goal reactions produce stimuli that have 
secondary reinforcing properties. These serve as the basis of the 
incentive component in Spence’s system. The anticipatory muscle 
readinesses that Guthrie discusses are similar. Third, there are cue 
functions. Here the stimuli produced by the mediating response serve 
as the guiding stimuli for further behavior. A striking example is 
Watson’s view that thinking is merely subvocal speech—a series 
of fractional speech movements that serve the function of talking to 
oneself. It is this cue function of mediating responses with which 
we will be mainly concerned here. 

In some cases it is fairly clear that mediating responses serve 
this cue function by changing the orientation of our sense receptors 
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and hence of the stimuli we receive. A person staring at something 
and a dog pricking up his ears are obvious examples. In such cases 
we can speak of the mediating responses as observing responses 
(Wyckoff, 1952). In other cases we cannot identify any specific 
observing response, but it is still plausible to say that the mediating 
response changes the effective sensory input in subtler ways, 
through a change in attention. Often it is difficult to say whether 
we are dealing with changes in receptor orientation or in attention 
at the level of the central nervous system. If it is true, as someone 
has said, that a man notices a woman’s legs, another woman 
notices her hat, and a thief notices her purse, is the difference in 
what they look at (observing responses) or in the part of what they 
look at that they notice (attention)? Perhaps it depends on some- 
thing as trivial as how far away the woman is, and hence whether 
it is possible to focus one’s eyes on all of her at once. So, although 
a full understanding of mediation would require knowing just what 
form the mediating responses take, for our present purposes we 
can group together all those cases where mediating responses of 
whatever form serve primarily a cue function. 


Cue-Producing Responses 


One phenomenon that can be explained by cue-producing 
mediating responses is the acquired distinctiveness of cues. This 
term refers to learning about which cues in a situation are im- 
portant. When a first-grader discovers that it is the shape of letters 
rather than their size that is important in reading, the cue of form 
has acquired distinctiveness for him. This acquisition will then 
affect his subsequent learning of letters and words. The response of 
paying attention to form is separate from any particular response 
that he makes to any particular form. This separateness has been 
demonstrated by D. H. Lawrence (no relation to the novelist) in 
several ingenious experiments with rats (Lawrence, 1949, 1950). 

Another closely related example of a situation where observing 
responses are important is in the formation of learning sets. These 
were first demonstrated by Harry F. Harlow of the University of 
Wisconsin (Harlow, 1959). To demonstrate a learning set, an indi- 
vidual is given a long series of discrimination problems. On each 
problem he is presented with two objects. One of these has a reward 
behind it on every trial, the other not. The same one is correct on 
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each trial. He has to find out by trial and error which one is correct, 
but once he has learned this he can be right on every subsequent 
trial of that problem. If he understands the principle, he can be cor- 
rect on all but the first trial of every problem. All he has to do is 
to choose either stimulus at random on the first trial. If it is correct, 
he chooses the same one on all the other trials. If it is wrong, he 
chooses the other one on all the remaining trials. For the com- 
pletely inexperienced learner, this solution is not as easy as it 
sounds. For one thing, the right-left position of the stimuli is 
changed from trial to trial, and some learners tend to respond to 
the position rather than to the object. There is also a widespread 
tendency to choose the other one on trial 2 regardless of whether 
the first one was rewarded on trial 1. Because of these and other 
error factors, inexperienced learners are likely to do poorly on the 
task. How are they to know that the same object, regardless of po- 
sition, will be rewarded on every trial? Gradually, however, they do 
learn this principle, so that learning gets better and better from 
problem to problem. After having experience with enough different 
problems, all of which can be solved in the same way, learners 
commonly reach the point where their scores on new problems 
are nearly perfect. They are then said to have formed a learning set. 
A learning set is an example of learning how to learn. It involves 
being set to approach the problem in a certain way. Since the ap- 
Proach to the problem requires paying attention to certain cues 
rather than others (form rather than position, for example), ac- 
quired distinctiveness of cues is one aspect of the formation of 
learning sets. In Tolman’s terms, we are dealing with a field cogni- 
tion mode. Most of the studies of learning sets have been with ani- 
mals, particularly monkeys. There are large species differences 
and age differences in the rate at which learning sets are formed, 
more so than in the learning of a single discrimination. The rate at 
which learning sets are formed has come to be regarded as one of 
the best measures of differences in intelligence among animals. 
Much of education may be regarded as the formation of learn- 
ing sets. We hope that in teaching students we are not merely pro- 
viding them with specific stimulus-response connections, but are giv- 
ing them the necessary background for continued learning after 
they leave school. We want them to be able, not merely to parrot 
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answers to questions, but to solve problems by making use of what 
they have learned. Studies of learning sets show individuals going 
through such a process of learning how to solve problems. The 
solution to one problem cannot be transferred directly to another 
problem, but the general method of solution can. Just as the “intel- 
ligent” behavior of the experienced monkey solving a discrimi- 
nation problem reflects his prior experience in forming a learning 
set, so the “intelligent” behavior of an adult human reflects his 
prior experience in learning how to solve problems. 

An especially important aspect of these cue-producing responses 
is the way that they change with age, becoming more complex and 
more important as determinants of behavior. An increase with age 
in the use of such mediating responses has been used by the 
husband-wife team of Howard and Tracy Kendler (1962) to ex- 
plain a difference in discrimination learning between adults and 
older children on the one hand and animals and young children 
on the other. Consider a set of stimuli that differ both in 
size and in color. Suppose a person is reinforced whenever he 
chooses the black one but never when he chooses the white one, 
regardless of size. In that case the variable of color is relevant to 
the discrimination but the variable of size is irrelevant. Even quite 
small children, as well as rats, can learn such a discrimination. 
Then suppose that the reinforcement contingencies are changed in 
either of two ways. In one case a choice of white is now always 
reinforced and a choice of black never, still irrespective of size. 
Since the significance of the color cues has been reversed, this is 
called a reversal shift. In the other case a choice of the large stimu- 
lus is always reinforced and the small never, regardless now of 
color. This change is called (not surprisingly) a nonreversal shift. 
(If it were always the small that was now reinforced, it would of 
course also be a nonreversal shift.) Either shift constitutes a new 
learning problem. Which will be harder to learn? As we have 
already hinted, for the human subject, the answer depends on the 
age, whereas rats behave like the younger children. The reader 
may want to stop at this point and guess which kind of shift is 
easier for the older subjects and which for the younger. 

It turns out that reversal shifts are easier for adults and older 
children, while nonreversal shifts are easier for young children 
and rats. The Kendlers argue that this difference occurs because 
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the older subjects make the mediating response of paying attention 
to the relevant dimension and ignoring the irrelevant, while younger 
subjects do not. For the older subjects, accordingly, a reversal shift 
requires only one change—from choosing black to choosing white. 
The mediating response of paying attention to color is still useful 
and does not need to be changed. The nonreversal shift, however, 
requires two things of the subject: first, to change from noticing 
color to noticing size; then, to learn whether large or small is re- 
inforced. Requiring more steps, the adjustment to the nonreversal 
shift is naturally slower. 

The analysis for the young children and the rats is a little more 
complicated for us to analyze, though for the subjects themselves 
it is simpler. In the original discrimination, the subject learns two 
separate stimulus-response connections, without mediation. These 
are: (1) given a large black and a small white, choose the large 
black, and (2) given a small black and a large white, choose the 
small black. In a nonreversal shift to “large correct,” (1) above is 
still correct; only (2) needs to be changed. In a reversal shift, 
however, both are now incorrect and must be changed. Conse- 
quently, for these subjects the reversal shift is the harder of the 
two. 

The cue function of mediating responses is particularly valuable 
for a rapprochement between connectionist and cognitive theory. 
If our own responses can provide us with cues and if these can 
include complex verbal cues, we are well on the way to a con- 
nectionist cognitive theory. If a man can make the responses of 
drawing a map on paper, can he not also make responses of 
imagining such a map? If he can give himself instructions in words, 
why not in images, in muscular sets, or in any other sort of stimulus- 
producing response? And if such nonverbal mediating responses 
are available to humans, why not to animals also? This line of 
Teasoning can be carried to the point where nearly all of the cog- 
Nitive theorists’ statements about life space, sign-gestalt expecta- 
tions, and cognitive maps are accepted, but all are reinterpreted as 
stimuli produced by the individual’s own responses. 

Clearly, not all of the suggestions about mediating responses 


that have been made by various connectionist psychologists have 
gone this far. Though Hull, Spence, Mowrer, and Guthrie all make 
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use of mediating responses, none of them regards the response- 
produced stimuli involved as having the extensive gestalt proper- 
ties that cognitive theorists favor. Nevertheless, there is room for 
such an extreme development of connectionist theory in the cog- 
nitive direction, and a number of such moves have been made. 


Psychologists in the Soviet Union have shown a related interest 
in their discussions of the second signal system. The kinds of con- 
ditioned stimuli that Pavlov studied, which signal the arrival of an 
unconditioned stimulus because they have been paired with that un- 
conditioned stimulus in the past, are referred to collectively as the 
first signal system. Words and other arbitrary symbols also serve 
as signals, but the learning by which they became signals is more 
complex; they are said to make up the second signal system. 

The complexities introduced by the second signal system are 
illustrated in an experiment quoted by Razran (1961). A boy was 
conditioned to salivate to the Russian word for “good” or “well” 
as the conditioned stimulus (S°), while the word for “bad” or 
“badly” was nonreinforced (an Sê). He was then tested with a 
variety of sentences that did not contain either word but that had 
definite positive or negative connotations, and was found to salivate 
much more to the positive than to the negative ones. He produced 
the most drops of saliva to “The Soviet Army was victorious” and 
the fewest to “The pupil was fresh to the teacher.” In addition to 
what this experiment tells us about conditioning, it gives us an 
interesting look at the value system of the Soviet Union! 


The Study of Meaning 


This consideration of the evaluative significance of verbal stimuli 
can serve as an introduction to another aspect of mediation, the 
meaning of meaning. 

An important development in this area is the work of Charles 
E. Osgood and his associates at the University of Illinois on the 
concept of meaning (Osgood, Suci, and Tannenbaum, 1957). They 
suggest that for every concept a person has, there is a mediating 
response which provides the meaning of that concept for him. 
(Actually they use the more general term “mediating process,” but 
‘or consistency we will continue to speak of mediating responses) 
These mediating responses are fractional parts of our response to 
the thing the word names. Thus our response to “sour” might be # 
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slight tendency to pucker the lips and salivate, and our response 
to “mountain” might be a slight tendency to look upward. The 
mediating responses may also be part of the emotional response to 
the thing. We might, for example, smile slightly in response to the 
word “happy” or become tense in response to the word “danger.” 

Just what the particular mediating responses are is something 
we usually do not know. As Osgood and his associates are care- 
ful to point out, they may not be responses in the usual sense, like 
those in our examples, but may be unobservable processes in the 
central nervous system. We can, however, provide indirect evi- 
dence to support the view that the mediating response to a word 
is similar to the response to the thing that the word names. This 
view states that both the word and the thing are stimuli for a me- 
diating response that produces further stimuli. These in turn may 
be the stimuli for some overt, measurable response. The mediat- 
ing response and the stimuli it produces are similar whether they 
occur in response to the word or to the thing. This, indeed, is what 
we mean when we say that the word means the thing. Conse- 
quently the final overt response to the word should be similar to 
that to the thing. That it is so has been demonstrated by experi- 
ments in which a bell or a blue light is used as the conditioned stim- 
ulus for some response and then the word “bell” or “blue” is pre- 
sented. A large amount of generalization is found from the stim- 
ulus to its name. 

It seemed unlikely to Osgood and his associates that there are 
as many unique mediating responses for meanings as there are 
words in the English language. Considering the many ways in 
which the meanings of different words are related, they thought 
that a fairly small number of different mediating responses should 
be enough to explain most of the meaning of words. Since they could 
Not study the mediating responses directly, they undertook instead to 
Study the different mediating responses in terms of the dimensions 
of meaning. In order to make this study, they developed a technique 
known as the semantic differential. 

In the semantic-differential method, one begins with several 
Pairs of opposite words. These might be such pairs as good-bad, 
Strong-weak, hard-soft, and wet-dry. Each of these pairs defines a 
dimension of meaning running from one extreme to the other, as 
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from good to bad or from strong to weak. Any other word can 
then be rated according to where it falls on each of these dimen- 
sions. A given person might rate the word “friend” close to the 
good end of the first dimension, half way between strong and weak 
on the second, and a little closer to soft than to hard on the third. 
This pattern of points on the various dimensions then defines the 
meaning of the word “friend” for this person according to this 
method. With two words, or with two people rating the same word, 
the difference between the patterns indicates how different the 
meanings are. 

Some of the dimensions that are used with this method turn out 
to be closely related, so that any word that is rated high on one is 
likely to be rated high on the other, and vice versa. Good-bad, 
for example, would be highly related to right-wrong. We can then 
conclude that these are really the same dimension. Over a long 
series of studies it is possible to determine how many really sepa- 
tate dimensions are needed to describe the meanings of words. 
It would then be plausible to suppose that each of these separate 
dimensions represented a separate mediating response. These sev- 
eral different mediating responses, occurring at the same time 
with different intensities in response to a stimulus word, would 
determine the meaning of that word. Osgood and his associates 
have found that a considerable part of the meaning of words can 
be accounted for with just one such dimension: the evaluative 
dimension. This includes such pairs as good-bad, beautiful-ugly; 
and kind-cruel, all of which are concerned with the desirability OT 
value of the word being rated. Much of the remaining meaning is 
carried by two other major dimensions: strong-weak and active- 
passive. We can tell a great deal about the meaning of a word by 
seeing where it is rated on these three main dimensions. 

There are, of course, limits to the meaning that can be 
given merely by placing a-word the proper distance between good 
and bad, between strong and weak, and between active and pas- 
sive. Some concrete descriptive words, such as “yellow” or “wet,” 
have a good deal of meaning not accounted for by the three pri- 
mary dimensions. Nevertheless, it is striking how much meaning 
is carried by the three, and especially by the evaluative dimension- 


How much real difference in meaning is there, for example, be- 
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tween “good,” “fair,” “honest,” and “upstanding?” To the aver- 
age person, probably very little, since they all are concerned pri- 
marily with evaluation. Whatever its limitations, the semantic dif- 
ferential has served both as a useful tool in practical studies and as 
a valuable approach to the quantitative understanding of mean- 
ing. Though a number of stimulus-response theorists have sug- 
gested that thinking involves the operation of mediating responses, 
Osgood’s work on meaning is by far the most ambitious attempt 
to study the implications of this view in quantitative terms. 


THEORIES BASED 
ON MATHEMATICS 
AND ENGINEERING 


AS THEORIES OF LEARNING become increasingly formal and attempt 
to make more exact quantitative statements about behavior, they 
tend increasingly to adopt language and concepts from mathematics 
and from more precise branches of science and technology. We 
have seen examples of this tendency in Lewin’s topological life 
space, in Tolman’s vectors, and particularly in Hull’s and Spence’s 
equations. Some theorists have gone farther than these in making 
mathematical analyses or concepts from engineering central to their 
theories. Such theories have been particularly prominent in recent 
years, reflecting the increasing use in modern society of computers, 
servomechanisms, and probabilistic interpretations of events. 
Beginning about 1950, there was a burst of interest in a statisti- 
cal approach to learning. This approach assumes that the processes 
in learning are not entirely predictable, but vary randomly in cer- 
tain ways that can be described statistically. Just as actuaries 
cannot predict who will marry or die this year, but can predict 
within a small margin of error how many marriages or deaths there 
will be, so statistical learning theorists do not claim to predict what 
a given learner will do on a given trial, but do claim to predict the 
average behavior of one statistical individual over many trials or of 
many Statistical individuals on one trial. Whereas Hull worked out 
a system for making exact predictions and then added the factor 
of oscillation to show the inevitable random component in be- 
havior, the statistical theorists make this randomness the basis of 
their systems. One might satirize the difference in approach by say- 
ing that Hull started with perfect order and eventually had to make 
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a concession to chaos, while the statistical theorists start with chaos 
and proceed to find order in it! 


ESTES’ STIMULUS-SAMPLING MODEL 


Since the several statistical models that appeared in the early 
1950’s had a lot in common, we will look at only one of them, the 
one developed by William K. Estes (b. 1919), who began his 
career as a statistical learning theorist at the University of Indiana, 
pursued it for some years at Stanford, and is now at Rockefeller 
University. His original system and a number of elaborations are 
presented in Estes (1959). 

Estes studied with Skinner during the period that Skinner was 
at the University of Minnesota, and he took his doctor-of-philoso- 
phy degree there in 1943. His work with Skinner on the effects of 
punishment was an important contribution to Skinner’s thinking 
on that topic. However, his interest in building mathematical 
models of learning is of course a departure from Skinner’s antithe- 
oretical bias. Moreover, the assumptions in Estes’ theory seem to 
show the influence of Guthrie, with whom he did not study, more 
than of Skinner. 

Estes’ system may be spoken of as a model of learning because, 
at least at the beginning, it makes no attempt to be a complete, all- 
encompassing theory. In this respect it is more modest than Guth- 
rie’s, Skinner’s, or Hull’s, and perhaps reflects some of the same 
concern about boundary conditions that Spence expresses. Rather 
the model is a very simplified statement of assumptions from 
which a few aspects of learning can be predicted with (hopefully) 
a good deal of accuracy. It is a model in the same sense as a three- 
dimensional model of an atom, with wooden balls for electrons, 
protons, and neutrons. No one claims that such a classroom model 
is a complete and accurate representation of an atom; we know 
very well that electrons are different from wooden balls and that 
their orbits have little in common with pieces of wire. Nevertheless 
there are certain respects in which the model and the real atom 
are alike. Given these similarities, the model enables us to predict 
certain things about the way the atom itself will behave. To the 
extent that a model enables us to predict some aspect of reality, 
it is useful. We need not argue whether or not it is true, since at 
best it is never more than a partial representation. This is much 
the same logic of theory construction that Hull used, but Estes car- 
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ties it farther, beginning with a simple model and expanding 
it gradually at the same time that he is testing its usefulness. 

One way to look at Estes’ model is as an attempt to make cer- 
tain ideas of Guthrie’s more precise, to turn a part of Guthrie’s 
broad, practically oriented theory into a model appropriate for 
laboratory study. The reader will remember that Guthrie re- 
garded the learning of a skill as the conditioning of a great many 
particular stimulus-response connections. Estes simplified this 
suggestion by grouping all possible responses into two categories: 
those that produce a given outcome and those that do not. Thus, 
in Guthrie’s terms, he is concerned with acts rather than move- 
ments. Estes would record, for example, only whether a basketball 
player successfully shoots a basket or not, without regard to the 
innumerable muscle contractions that go to make up either out- 
come. Similarly, a circus dog either jumps through the hoop or 
does not, regardless of whether the jump is graceful or clumsy and 
regardless of whether not jumping through involves jumping and 
missing, sitting quietly, or running away. In this way Guthrie’s 
concern with what the learner does is changed into a concern with 
what he accomplishes, with the successful and unsuccessful out- 
comes of his behavior. (Note, however, that success is defined by 
the observer, not necessarily by any purpose of the learner’s.) We 
may call these two classes of responses (or, strictly speaking, of 
response outcomes) Al and A2. 


Stimulus Elements 


Having simplified the matter of responses, Estes is prepared to 
face the complexity of stimuli. He regards any stimulus situation 
as made up of many small stimulus elements. In principle, such an 
assumption is clearly realistic. At any given moment we are being 
bombarded by a great number and variety of stimuli. As we read 
a book, we may be stimulated by the words on the page, the pres- 
sure of the book on our hands, the furniture around us, and the 
sound of a passing car, to give only a very incomplete list. Alto- 
gether these make up the stimulus complex that influences our 
behavior. However, if we try to analyze this complex into specific 
components, we immediately find ourselves in difficulty. Is the 
pressure of the book on our hands one element, or is it ten elements 
for our ten fingers? If the temperature and the texture of the book’s 
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cover enter into the sensation, are these additional elements? It is 
a good example of Estes’ model building that he does not concern 
himself with this question. If it is reasonable to suppose that there 
are such stimulus elements, Estes does not care just what the ele- 
ments are, or how many there are, or how you decide where one 
element ends and another begins. He wants to know whether 
learning acts as if it is under the influence of such elements, not 
whether such elements “really” exist in the sense of being separately 
identifiable. 

Another problem about this analysis of stimulus complexes 
into elements is that it implies that these elements operate sepa- 
rately, that there are no effects of patterns. Since we don’t know 
what the elements are, it is hard to judge just how unrealistic this as- 
sumption is, but it certainly seems likely that there must be pat- 
terning effects of some sort. Estes eventually considered this issue, 
as we will see, but his original model used the assumption that 
there are no patterning effects, that each element operates inde- 
pendently of each other one. Again the question is not whether 
this assumption is true but whether it is useful. To answer this 
question we must wait until we see the predictions Estes has been 
able to make from a model that included this assumption. 

So far Estes has divided all the possible responses in a given sit- 
uation into two classes, and he has divided all the possible as- 
pects of the stimulus situation into an unspecified large number of 
elements. He now further assumes that each element is conditioned 
to one or the other of the two response classes. In other words, each 
stimulus element tends to evoke either Al or A2. An element can- 
not be conditioned at the same time to both Al and A2, nor can 
it ever be conditioned to neither. (If it seems unreasonable that 
all elements must be conditioned to one or the other of these re- 
sponse classes, keep in mind that all possible responses are included 
in one or the other class.) Accordingly, at any given moment, all 
elements can be classified as either conditioned to Al or con- 
ditioned to A2. 

The use of the word “conditioned” in this connection does not 
mean that there has necessarily been prior learning. It might be 
clearer just to say that every element is “attached” to one or the 
other response class, so that that stimulus element tends to pro- 
duce that response. However, these connections are subject to 
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change, with stimulus elements which were formerly attached to 
A1 becoming attached to A2, or vice versa. In fact, for Estes such 
changes are what is meant by learning. These changes are a proc- 
ess of conditioning, and hence Estes speaks of an element as being 
conditioned to a response when it tends to elicit that response. 


Stimulus Sampling and Conditioning 


To understand how the conditioning process operates in Estes’ 
model, we must first look at the process of stimulus sampling. Es- 
tes assumes that only a small part of the stimulus elements in a 
given situation actually influence the response on any given trial. 
Estes refers to all of the stimulus elements in a given situation, taken 
collectively, as S. At any particular moment, only a small fraction 
of the elements in S are actually effective in determining the re- 
sponse. Which elements will they be? Estes assumes that the indi- 
vidual samples elements randomly from S. We need not assume 
that this sampling process is conscious and deliberate, as though 
the individual were window shopping before an array of stimuli. 
The sampling may be explained by momentary changes either in 
the environment or in the individual's attention. Again, however, it 
is important to note that Estes does not try to explain just how this 
sampling occurs, He assumes that it does occur, and whether it is 
a good or a bad assumption depends on whether it leads to valid 
predictions. 

Each element has a certain probability of being sampled on 
a given trial. It is mathematically simpler to assume that all 
elements have the same probability of being sampled. Since the 
whole concept of stimulus elements is hypothetical anyway, 
there should be no objection to assuming that these probabilities 
are equal. Moreover, Estes has demonstrated that, at least for cer- 
tain purposes, it makes very little difference whether one makes this 
assumption or not. He therefore assigns the Greek letter 0 (theta) 
to represent the probability that any given element will be sam- 
pled on any given trial. Put another way, 6 represents the propor- 
tion of all the elements in S that are sampled on any one trial. 

On successive trials, different groups of elements are sampled. 
This sampling is random, so that each element has the same 
chance of being sampled on each trial. Whether or not an element 
was sampled last time has no effect on its chance of being sampled 
this time. Moreover, whether or not any one element is sampled 
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this time has no effect on any other element’s chances this time. 
Hence the degree of overlap among successive samples is deter- 
mined solely by chance. This assumption, like so many others in 
Estes’ model may seem unrealistic if we think of stimulus ele- 
ments as particular identifiable sources of sensations. The impor- 
tant question, however, is whether Estes can use these oversimpli- 
fied assumptions to make correct predictions about learning. 

Let us consider two successive trials. For the sake of simplicity, 
let us say that in this situation 6 is equal to 0.1. This means that 
one-tenth of the elements in S are sampled on each trial. Since 
the sampling is random, we can find the proportion of elements 
sampled on both trials by multiplying .1 times .1 to obtain .01. 
This means that one-hundredth of the elements in S are sampled 
on both trials. How many are not sampled on either trial? Since .9 
of the elements (1 minus .1) are not sampled on the first trial, and 
.9 are not sampled on the second, we find the number not sam- 
pled on either trial by taking .9 times .9, which gives .81. By sub- 
tracting .81 from 1.0, we find that .19 of the elements were sam- 
pled on at least one of the two trials. 

The stimulus elements sampled on a given trial are the ones that 
determine which response, Al or A2, will occur on that trial. But 
if some elements are conditioned to A1 and others to A2, which of 
these two responses will occur? The answer to this question shows 
why this is called a statistical learning theory. Estes does not tell us 
definitely which response will occur; instead he gives the answer 
in the form of a probability statement. The probability of A1 is 
equal to the proportion of the elements in the sample that are con- 
ditioned to Al. Thus if 60 per cent of the elements that are sam- 
pled on a given trial are conditioned to Al and the other 40 per 
cent to A2, there are 60 chances in 100 that A1 will occur on that 
trial and 40 chances that A2 will occur. Since the sample is drawn 
randomly from S, we could just as well say that the probability of 
A1 occurring on that trial is equal to the proportion of all the ele- 
ments in S that are conditioned to A1. 

Since the prediction of which response will occur is in the form 
of a probability, we can never say for sure which response will 
occur (unless the probability of A1 is either 1.00 or 0). This un- 
certainty of prediction may seem like a considerable weakness. 
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However, Estes is really only saying, a bit more bluntly, what the 
other theorists we have considered also said. Guthrie said that the 
combination of stimuli would “tend” to be followed by the move- 
ment. Skinner discussed the factors controlling rate of emission of 
an operant, but he never tried to predict exactly what that rate 
would be. Hull, as we have noted, accomplished the same thing 
by including the concept of oscillation. Thus predictions from 
Estes’ theory are no less definite than those from other theories. 
He has, however, put the whole theory in terms of probabilities: 
the probability that an element will be sampled and the probability 
that a given response will occur. (These probabilities can in prin- 
ciple be converted into speeds if desired.) Since probabilities are 
analyzed by statistical techniques, Estes’ theory is called a statistical 
theory. 

In his original formulation, Estes assumed that learning is by the 
principle of contiguity. This relation means, in his formulation, that 
whenever a response occurs, all the stimulus elements sampled on 
that trial become conditioned to that response. Nothing is said about 
reinforcement as such. This formulation is particularly well suited 
to classical conditioning. Consider a case of eyelid conditioning, 
We will call an eyeblink response Al and the absence of a blink 
A2, so that on every trial either Al or A2 but not both will occur. 
On the training trials the blink always occurs, since the uncon- 
ditioned stimulus is always presented. As a result, all the stimulus 
elements sampled on a given trial become conditioned on that trial 
to Al (the response of blinking). On successive trials, additional 
elements are sampled and become conditioned to A1, so that the 
Proportion of the elements in S that are conditioned to Al keeps 
increasing. Whenever an element is conditioned to A1 it is of 
course also unconditioned to (or “disconnected from”) A2. 
Thus conditioning is a process of transferring elements from being 
conditioned to A2 to being conditioned to A1. As this process goes 
on, the probability of a conditioned eyeblink response (A1) 
increases. The unconditioned blink occurs on every trial, since it is 
elicited by the unconditioned stimulus, but the conditioned blink 
may or may not occur, its probability depending on the proportion 
of elements conditioned to Al. 
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Learning Curves 


With all this background of assumptions, we are finally ready to 
look at the predictions which follow from them. First, what of the 
shape of the learning curve? Suppose we take the situation in 
which, at the beginning of learning, all the elements in S are condi- 
tioned to A2, so that the probability of a conditioned Al response is 
zero. This case would be very near that in eyelid conditioning, 
where the chance that the subject will make a response to the con- 
ditioned stimulus before conditioning has begun is quite small. 
This condition means that all the elements sampled on the first trial 
will already be conditioned to A2 (not Tesponding). However, 
on this trial they will become conditioned to Al, On the next trial 
another sample will be drawn. Most of the elements in this new 
sample will be conditioned to A2, but a few will be conditioned to 
Al because they will have been sampled, and hence conditioned to 
Al, on the previous trial. Only those that are attached to A2 can 
become conditioned to A1, since the rest are already attached to 
Al. Thus the number of elements transferred from A2 to A1 will 
be less on the second trial than on the first. (Remember that the 
total number of elements sampled is the same on all trials.) On the 
third trial still more of the elements that are sampled will already 
have been conditioned to A1, so that even fewer will be transferred 
from A2 to A1. Learning will continue in this fashion, with fewer 
elements being transferred from A2 to Al on each successive trial, 
until all the elements in S are conditioned to Al. At that point, no 
further learning can take place. 

If we know the value of 6, we can make Predictions about the 
Proportion of elements in S that will be conditioned to A1 at the 
end of each trial. Let us return to the Previous discussion of sam- 
pling in which we assumed for convenience that 0 was equal to 
0.1. At the end of the first trial, 0.1 of the elements in S had been 
sampled once. As a result, 0.1 of the elements were then attached 
to Al. After two trials, .19 of the elements had been sampled at 
least once, so that .19 of the elements were then conditioned to Al. 
The increase in the Proportion of elements conditioned to Al was 
-1 as a result of the first trial and .09 as a result of the second. In 
both cases, this increase was 0.1 of the elements still conditioned 
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to A2. In other words, on each trial, 10 per cent of the elements still 
conditioned to A2 were transferred to Al. And this value, 0.1, 
is the value of 6. If we continued these calculations further, we 
would sce that this ratio fits a general principle: on each trial the 
proportion of elements transferred from A2 to Al is equal to 6 
times the proportion still conditioned to A2. However, since fewer 
and fewer are still conditioned to A2 on each trial, the actual num- 
ber of elements transferred from A2 to A1 is less than on the pre- 
vious trial. 

This formulation has resemblances to both Guthrie’s and Hull’s 
discussions of the learning curve. Guthrie assumed that condition- 
ing of a movement to a stimulus occurs on one trial, but that 
conditioning of an act requires many trials because it is made up 
of many stimulus-movement connections. Similarly, Estes as- 
sumes that an element is conditioned to Al on one trial, but that 
the change in probability of Al is gradual because there are 
so many elements to be conditioned. Thus in this respect Estes has 
taken an idea very much like Guthrie’s and changed it from a 
loose verbal interpretation to a precise mathematical one. The math- 
ematical description of the learning curve then turns out to be 
identical with Hull’s. The advantage of Estes’ formulation over 
Hull’s is that instead of simply postulating that the curve should 
have this shape, he has derived this curve as a prediction from the 
more basic assumptions of sampling. He has, in a sense, out-Hulled 
Hull. 


Other Aspects of Learning 


Once a reader has grasped the fundamental assumptions 
(“postulates,” in Hull’s terms) of Estes’ system, he can exercise 
his ingenuity by trying to predict how Estes will deal with vari- 
ous phenomena of learning. The phenomenon of generalization 
poses no problem. If a response has been learned to one set of stim- 
uli, generalization is indicated by the occurrence of that same Te- 
sponse when another, similar set of stimuli is presented, For Estes, 
a “similar set of stimuli” is one which has a number of the same 
Stimulus elements as the original training set. If all of the elements 
in the first set are conditioned to A1, and if half of the elements in 
the second set are also in the first set, then at least half of the ele- 
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ments in the second set are conditioned to Al. The probability of 
an A1 response to the second stimulus set cannot then be less than 
-50. : 
Discrimination is more of a problem. If two stimulus sets have 
a number of elements in common, how can an individual learn 
consistently to give response A1 to one of these sets and response 
A2 to the other? For such perfect discrimination to be established, 
it would be necessary for all the elements in one set to be condi- 
tioned to A1 and all those in the other set to be conditioned to A2. 
This condition, however, is impossible, since some of the same ele- 
ments are in both sets. Estes is not yet satisfied that he has found a 
completely adequate way of dealing with this problem. Probably the 
best suggestion has come from another statistical learning theo- 
rist, Frank Restle, Restle suggests that stimulus elements which are 
common to both sets become neutralized, that is, not conditioned to 
either Al or A2. They are then irrelevant to the discrimination. 
The acceptance of this idea calls for some modification of the Es- 
tes model, but it seems difficult to avoid some such interpretation. 
Perhaps we need to consider not only responses Al and A2 but also 
other kinds of responses that involve Paying attention to certain 
stimulus elements and not others. This rather cognitive suggestion 
is consistent with the views that we considered in Chapter 6. 

Drive should be another easy concept to predict. For Estes, 
as for Guthrie and for Miller, drive is a stimulus (or, more precisely, 
a set of stimulus elements). Estes’ Principal contribution in this 
regard has been to argue that Hull’s intervening variable D is 
unnecessary. Hull used D to refer to the general energizing func- 
tion of all drives. In an ingenious mathematical example, Estes 
showed how what appears to be an over-all energizing effect 
of drive could be explained as simply a matter of the number of 
drive stimulus elements conditioned to different responses. This 
revision does not necessarily mean that drives do not have an en- 
ergizing function, since Estes’ example is hypothetical, but it 
throws the burden of the argument on Hull's supporters. 

Extinction, for Estes, is simply another example of condition- 
ing. If in acquisition, stimulus elements are conditioned to response 
class Al, then in extinction they are reconditioned to A2. The 
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laws of learning are exactly the same in acquisition and extinc- 
tion. This interpretation is essentially the same as Guthrie’s. Guth- 
rie says that extinction is learning to do something else, which 
raises the question, “what else?” Estes can dodge this question 
because, by definition, any response that is not Al is A2. In ex- 
tinction, Al is being replaced by something else, but Estes con- 
siders only the rate at which that replacement goes on, not the 
details of what the “something else” looks like. 


Forgetting and Spontaneous Recovery 


Spontaneous recovery is a bit more of a challenge, but again 
Estes deals with it in a typically Guthrian fashion. To do so, he 
has to consider a larger category of stimulus elements than be- 
fore. He assumes that S, the set of stimulus elements from which a 
learner samples in a given learning situation, is only a subset of 
all the elements that are sometimes available for sampling in that 
situation. In other words, if we consider all the stimulus elements 
that might be available for sampling in a certain learning situation, 
only some of them will actually be available for sampling on any 
given occasion. This seems like a quite plausible assumption. 
There are many characteristics of learners, teachers, and other as- 
pects of a learning situation that vary from one occasion to an- 
other. If the learner is more tired or more distracted on one occa- 
sion than on another, for example, this difference could be expected 
to make some difference in what stimulus elements were available 
for sampling on the two occasions. We can therefore divide all the 
stimulus elements that might be available during this learning into 
two categories: S (those available on a given occasion) and S’ (those 
not available on this particular occasion but potentially available on 
others). On any two occasions, some elements will be in S on both 
occasions, some in S the first time and in S’ the second, some in 
S’ the first time and in S the second, and some in S’ on both. This 
assumption makes it possible to predict not only spontaneous re- 
covery but also various other phenomena of learning. 

Consider a learning situation in which originally all stimulus ele- 
ments are conditioned to Al (see Fig. 7, part A). In the course of 
conditioning, all of the elements in S are sampled at some time and 
become conditioned to Al. The next day the individual is tested 
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for retention. By then, 20 per cent of the elements in S have moved 
into S’ (i.e., are no longer available for sampling) and have been 
replaced by elements that were formerly in S’ (Fig. 7, part B). 
Since all of these new elements in S are still conditioned to A2, 
20 per cent of the elements now in S are conditioned to A2. As a 
result, the probability of an Al response drops from 1.00 at the 
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to Al. Unless there was additional training involving some of the 
same elements, forgetting would thereafter be impossible. (This 
is an untestable assumption, since we could never be sure that no 
such training had taken place.) Suppose now that extinction 
is carried out for one day until all elements in S are conditioned 
to A2. If 20 per cent of the elements in S move into S’ before the 
next test and are replaced, S will then be made up of 80 per cent 
elements conditioned to A2 and 20 per cent conditioned to Al. 
This means that 20 per cent of the response strength before extinc- 
tion will have recovered spontaneously. A whole series of extinctions 
on successive days would be necessary in order to prevent any 
further spontaneous recovery. 

This formulation of spontaneous recovery and forgetting em- 
phasizes the importance of practice over a series of occasions. It 
takes not merely a lot of practice, but practice spaced in time to 
give maximum possible learning. Herein rests one of the diffi- 
culties with cramming and with very concentrated courses. Es- 
tes’ theory, like Guthrie’s, thus supports the view held by many 
educators that learning is most effective when there is time enough 
to spread the process out. Reviewing learned material every now 
and then, or taking a series of courses in which each begins with 
a review of the previous one, are procedures that fit well with this 
interpretation of learning. 


Reinforcement 


What does Estes have to say about reinforcement? Relative 
to many other theorists, not much. So far as he is concerned, a rein- 
forcer is simply any event that determines to which response class 
the stimulus elements that are sampled will get conditioned. What 
other properties a reinforcer may have, such as perhaps being a 
reduction in drive, do not interest him. He is thus somewhat like 
Skinner in the way he interprets reinforcement, but he puts less 
emphasis on the importance of the topic than does Skinner. 

Estes’ treatment of reinforcement also resembles Skinner’s in 
another respect: both are greatly interested in intermittent rein- 
forcement. What happens, asks Estes, if some of the trials in a 
series end with reinforcement (thus conditioning all the elements 
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sampled on that trial to A1) and other trials end with nonrein- 
forcement (conditioning all the sampled elements to A2)? A fa- 
vorite learning situation for answering this question is the light- 
guessing experiment. Here A1 and A2 are defined, not as mak- 
ing or not making a given response, but as making one of two pos- 
sible guesses. The learner is seated in front of a board with two 
light bulbs on it. On each trial he is instructed to guess which of the 
two lights will be turned on. After he guesses, one and only one of 
the lights does go on, and which one it is does not depend on 
his guess. The two possible guesses are response classes Al and 
A2, and the two possible events (the left and the right light going 
on) are called El and E2. Thus E1 reinforces Al and E2 rein- 
forces A2. However, Estes assumes that El conditions all the 
sampled elements to Al and E2 conditions all the sampled ele- 
ments to A2, regardless of which Tesponse occurs on that trial. 
In other words, the left light going on reinforces the response of 
guessing “left,” even if the guess the learner made was “right.” 
This assumption that a response which does not occur on a given 
trial can nevertheless be reinforced on that trial is a departure 
from the usual assumptions of connectionist reinforcement the- 
ories. 

If El always occurs in this Situation, the individual gradually 
learns to make Al consistently. (He also rapidly becomes bored 
with the experiment.) What if E1 occurs 75 per cent of the time 
nd E2 25 per seit of the time, in a random order so that the 
earner can never be sure which light will o on on any given 
trial? What usually happens is that be th A make ALIS per 
cent of the time and A2 25 per cent of the time. Any reader who is 
ambitious enough to work through the mathematics can confirm 
for himself that this is just what Estes’ model predicts. (To do so, 
the reader will have to ask himself what per cent of the sampled ele- 
ments will be conditioned to Al by El or to A2 by E2, when the 
probability of A1 is greater than, equal to, and less Haa 15. It 
will take a little thinking, but this opportunity to obtain an insight 
experience while studying a connectionist theory really should not 
be missed.) 

This finding, so satisfying to Estes, has disconcerted many other 
theorists because of its apparent irrationality. If the learner wants 
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to guess correctly as often as possible, his best strategy is to guess 
the more frequent event all the time. A cognitive theorist might ex- 
pect him to follow such a rational strategy, since it would be the 
best possible insightful solution to the problem. Instead, however, 
subjects follow the irrational behavior that is predicted by Estes’ 
conditioning model. Why? Many experiments have been done 
and many words written in attempts to explain this “irrational” 
behavior to other people’s satisfaction. (Some situations have been 
found in which learners do adopt the “rational” solution, but they 
are not typical.) Estes, however, can take pride in having correctly 
predicted an unexpected finding. 

Another aspect of the theory has been confirmed in a different 
version of the light-guessing experiment. The object of this experi- 
ment was to provide a more direct test of the assumption that the 
probability of A1 is equal to the per cent of sampled elements that 
are conditioned to Al. Ordinarily we have no way of identify- 
ing the stimulus elements in any situation. This problem was par- 
tially solved by giving learners in the light-guessing situation a 
hint before each guess. The hint was a pattern of small lights on 
a board, separate from the lights being guessed. Some of these 
small lights were turned on only on trials when the left light was 
going to be correct, others only on trials when the right light would 
be correct. Thus some lights signaled E1 and others E2. These 
hints made it possible for the learner eventually to predict cor- 
rectly every time which light would go on. This component made 
the experiment an example of discrimination learning. When the 
discrimination learning was complete, a test was given in which 
some of the signal lights from both sets were given before the 
learner made his guess. If each of the small signal lights acted as a 
stimulus element, what should Estes predict? The probability of 
guessing A1 should be equal to the per cent of signal lights on the 
test trial that had previously signaled E1. If, for example, two- 
thirds of the signal lights on the test trial were ones that had pre- 
viously signaled E1, and one-third were lights that had previously 
signaled E2, A1 should occur two-thirds of the time. This and 
several other such predictions were confirmed in this experiment, 
thus giving additional support to Estes’ assumption about stimu- 
lus elements and response probabilities. 
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How does Estes’ approach compare with those of other theorists 
we have considered? In his approach to theory construction he is 
most like Spence, building an abstract formal theory, but proceed- 
ing slowly, with a close eye both on new data and on the boundary 
conditions. However, he has put more emphasis on stimulus factors 
and less on the nature of reinforcement than has Spence, and the 
kinds of experiments he has done are rather different from 
Spence’s. His interpretations of most topics are much like Guthrie’s, 
but more detailed mathematically than Guthrie’s, His interpretation 
of reinforcement, however, is most like Skinner’s. 
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A1 to always giving it. Only for the average of a group would the 
increase in probability appear gradual. 

Psychologists are so used to averaging learning curves from 
groups of subjects, in order to smooth out the fluctuations that 
individual curves commonly show, that they often do not even look 
at the data which might tell them whether individual learning 
curves more closely approximate the gradual or the sudden form. 
On the other hand, Skinner and his followers had criticized the use 
of averaged curves, Guthrie had emphasized all-or-none learning at 
the molecular level, and the gestalt psychologists had studied cases 
of sudden insight. Then in the late 1950’s gradual vs. all-or-none 
learning became a major focus of controversy and research. 

Estes, having led in the earlier many-elements models, was also 
a leader in developing one-element models. He suggested that 
even though memorizing a list of items is gradual, the learning of 
any one of these items may well be all-or-none, and presented evi- 
dence in support of that view. The evidence has since been chal- 
lenged, but the view remains important. He also suggested that, 
in a one-element model, it is more appropriate to speak of a stimu- 
lus pattern than of a stimulus element; since the stimulus as a whole 
is what gets conditioned, it is the total configuration of the stimulus 
that is or is not sampled and attached to Al on any given’ trial. 
Thus some of the flavor of gestalt theory enters into statistical 
learning theory. These newer views are summarized in Estes 
(1964). 

It should be evident from the above that there is not so much 
a controversy between gradual and all-or-none interpretations of 
learning as a question of which are appropriate for what kinds of 
learning. The simpler an item to be learned, the more likely it seems 
that it would be learned in an all-or-none fashion, while the more 
complex it is, the more likely it seems that multiple connections 
would have to be acquired and learning would therefore be gradual. 
But why limit ourselves to these two extremes: that learning is 
sudden and complete or that it is gradual and continuous? If we 
can have either a one-element or a many-element model, why not a 
two-element model? 

In a two-element model, learning would be expected to involve 
three stages: the initial stage in which neither element has been 
conditioned to A1, an intermediate state in which one element has 


196 MATHEMATICS AND ENGINEERING 


and the other has not, and a third in which both have. The process 
of acquisition would therefore involve two sudden increases in the 
probability of making a correct response. There are numerous 
situations in which one might plausibly expect learning to be of 
this form. In conditioning, it may be necessary first to learn what 
to pay attention to and then what to do about it. In paired-asso- 
ciate memorizing, it may be necessary first to learn a response and 
then hook it up with the appropriate stimulus item. In each case 
the learner who has mastered the first part but not the second 
would be intermediate in performance between the learner who 
had mastered both and the learner who had mastered neither. 
Since one of these two learning tasks would have to be mastered 
before the other, the specific model of two stimulus elements wait- 
ing to be sampled seems a bit inappropriate, but the more general 
idea of randomly determined transitions from ‘one state to another 


is quite appropriate. Various forms of learning do indeed seem to 
fit a two-element model. 
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of most interest to cognitive psychologists. Statistical learning theory 
is an active, changing field today, and its eventual contribution re- 
mains to be seen. At least, it has provided psychology with a new 
Set of conceptual tools and has perhaps pointed the way, by slow 


degrees, toward the kind of precise, general learning theory that 
Hull envisioned. 
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present instructional programs, and in various other ways. For 
theorists, a particularly interesting application has been the attempt 
to draw an analogy between the operation of computers and the 
operation of the human intellect. Computers carry on various 
processes that bear marked resemblances to human learning, re- 
membering, and thinking. Can the way that computers carry on 
these processes tell us anything about how humans do so? 

One fairly simple use of the computer to simulate learning has 
been in connection with Estes’ stimulus-sampling model and other 
statistical models of learning. In some applications of these models, 
such as to learning in the T maze, the reinforcement depends on 
what the learner does. It is not possible, therefore, to generate a 
simple curve for a “standard learner.” For example, with reinforce- 
ment on both arms of the T maze and with a large value of 0, a rat 
who happened to turn right on the first three trials might by then 
have developed such a high probability of turning right that he 
would never get over to the left side to be reinforced there, while 
one who initially happened to turn left might similarly end up 
“trapped” on the left. (The similarity to Skinner’s demonstrations 
of superstitious behavior should be evident.) When the learning 
curve of a given individual can follow so many different paths de- 
pending on chance, predicting the average learning curve and the 
variability of a group of learners is a very difficult mathematical 
operation. The easiest solution is to program a computer to gen- 
erate the predicted behavior (and hence learning curves) of a 
group of “stat rats,” that is, of imaginary rats that follow the model 
exactly (insofar as anything with a random component can be 
exact). The behavior of these statistical rats, as indicated by the 
output from the computer, can then be compared with the behavior 
of real rats, or of any other kind of learners one wishes to study. 

For the most part, however, such relatively simple connectionist 
models of learning have not excited people who use the computer 
for the analysis of learning. The most intriguing examples of such 
simple learning analogs have not even bothered to use a computer. 
A small machine can be built, far less complicated than a com- 
puter, that will behave like a real organism in that it moves 
around, seeks certain goals, avoids others, and demonstrates simple 
forms of learning. Walter (1953) produced such a machine some 
years ago and named it (imitating biological taxonomy) Machina 


198 MATHEMATICS AND ENGINEERING 


docilis (the teachable machine). This little machine would ride 
around on its wheels, approaching lights of moderate intensity and 
moving away from very bright lights. It could be conditioned to 
approach a whistle that was paired with the turning on of a 
“reinforcing” light, and this conditioning was subject to both ex- 
tinction and forgetting. It would avoid obstacles, show “fear” of a 
stimulus that had been paired with a kick, and behave ambivalently 
toward other members of its “species.” In the days before com- 
puters became widespread, when the behavior of machines was 
widely regarded as something completely different from the be- 
havior of living organisms, such a machine constituted quite a 
striking demonstration. Even after looking at its wiring diagram, one 
tended to feel that there was something magical about a “mere 
machine” behaving in a fashion so lifelike. Now, when computers 
have taken over many business Operations and computerized robots 
are a stock-in-trade of movies and television, much of the magic 
has gone out of such a demonstration but it has still not completely 
lost its power to beguile us. 

For the most part, computer analogs of learning have been of 
more complex, cognitive processes. A number of these are dis- 
cussed in Feigenbaum and Feldman (1963). One of the most 
interesting is the Logic Theorist of Newell, Shaw, and Simon. This 
is a computer program whose function is, given the postulates of 
geometry, to prove certain theorems. The manner in which it goes 


about doing so is quite comparable to what a good high-school 
geometry student does. It works forward from theorems already 
proved to try to find ways of turning one 


of them into the theorem 
to be proved. It works backward from t 
to other statements which 
then treats these statemen 


solving! 


What do these computer programs tell us about the actual 
processes of learning and thinking in living organisms? Nothing 
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directly; the fact that a computer can be programmed to learn or 
think by certain processes does not demonstrate that either humans 
or rats learn or think by the same processes. What these programs 
do show are the implications of certain assumptions about how 
organisms learn. A theorist can write a computer program to follow 
the laws of learning that he thinks organisms follow, then run the 
program to see what happens. If what happens is similar to the 
outcomes of human or animal learning, this finding makes it more 
plausible that humans or animals do in fact follow those hypothe- 
sized processes when they learn. If what happens is quite different 
from the outcomes of “real” learning, the theorist’s assumptions 
are probably false. Thus the computers provide a more precise 
way of doing what Hull wanted to do; if he had made use of 
computers he might have avoided some of the contradictions into 
which his systent fell. The role of the computer program is much 
like that of mathematical analysis—not to change the logic of 
theory construction, but to carry out that logic more precisely and 
more thoroughly. 


CYBERNETICS 


Another contribution from engineering is the application of 
feedback theory, or cybernetics, to the study of learning. The term 
“cybernetics” was coined by Norbert Wiener (1948) to refer to 
the study of control mechanisms. The word is derived from the 
Greek for “steersman,” since it is concerned with devices that keep 
some operation, like the sailing of a ship, on its proper course. In 
order to keep an operation on course, it is necessary to compensate 
for any deviation in either direction. If a ship drifts off course 
either to port or to starboard, the helmsman must move the rudder 
the proper amount and direction to bring the ship back on course. 
This illustration presents the general concept of negative feedback. 
Negative feedback involves adjustments in a system to keep it in a 
steady state by compensating for any deflections from that state. 

The concepts of cybernetics emerged from the branch of engi- 
neering concerned with control mechanisms. Such mechanisms 
operate by negative feedback. A thermostat is an example. When 
the temperature falls, the thermostat turns on the furnace and thus 
makes the temperature rise. When the temperature rises to a cer- 
tain point, the thermostat turns the furnace off and thus makes the 
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temperature fall. In engineering terms, the thermostat and the 
furnace together make up a system. The temperature in the room 
is the input to the system, and the activity of the furnace is the out- 
put from the system. The effect of the output is fed back into the 
system as the input (hence the term feedback) and the effect of 
this feedback is to produce a change in the opposite direction from 
the input (so that the feedback is negative). There are many such 
mechanisms (they are often called servomechanisms), and cyber- 
netics is an attempt to apply their principles to problems in psy- 
chology and related fields. i 

When a person performs any skilled act, he is constantly guided 
by sensations from his muscles, usually also from his eyes, and 
often from other sources. These sensations warn him whenever 
he is starting to make a mistake and thus enable him to return to 
the proper procedure. This Tesponse often happens so quickly 
and automatically that he is unaware of it, but it occurs none- 
theless. This behavior is an example of negative feedback, with the 
output being fed back to control the operation and keep it on course. 
The important part that knowledge of results plays in skilled per- 
formances can thus be expressed in terms of feedback. 

When we proceed to more complex purposive behaviors, the 
same principle applies. A studen 
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but also in analyzing the behavior of groups. The system it describes 
can as well be a club or a school class as an individual person. A 
variety of applications of cybernetics to behavior is presented by 
Smith and Smith (1966). 

The great weakness of feedback theory in the psychology of 
learning is that, like Lewin’s system, it is not really a theory 
of learning at all. Feedback language is useful for describing any 
kind of purposive behavior, and thus will often apply to situations 
in which learning is taking place. The theory does not, however, 
make any distinction between learning and nonlearning situa- 
tions. Consider, for example, an individual who is learning to use 
an axe. Each separate swing of the axe is controlled according to 
the feedback principle. If he observes that his blow will strike too 
high on the tree, he adjusts his swing to strike lower. This adjust- 
ment in itself is not learning. However, the axman also discovers 
that certain ways of standing, holding the axe, and swinging give 
better results than others. This discovery leads to changes within 
a session of chopping in the way he does these things. Both 
changes are examples of feedback. However, we can attribute 
most of the latter change to learning, and we can speak of the suc- 
cessful techniques as being reinforced. Finally, the person observes 
how well this activity of chopping is obtaining his goal, whether 
it is to earn money, to strengthen his shoulders, or to beautify his 
yard. If he notes a large discrepancy between his achievement 
and his goal, he may give up chopping altogether. This response 
would still be an example of feedback. It would also involve 
learning, but a different sort from the learning of skill with the 
axe. 

So far, cybernetics as applied to learning is just another descrip- 
tive language. The starting point for a feedback analysis is a state 
of affairs that the system seeks to maintain. In many cases this state 
would relate closely to the concept of drive, since drive involves 
some disruption of the normal state either by the removal of some- 
thing needed or by the presentation of something noxious, (The 
reader will recognize this as an oversimplified and somewhat con- 
troversial definition of drive.) The principal prediction which feed- 
back theory makes is that the system will compensate for any de- 
viations that occur, so as to maintain this state of affairs. The way 
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in which it does so is a topic to be studied, not something about 
which the theory already makes predictions. Discrimination among 
cues, reinforcement of responses that reduce drive, the approach- 
ing of incentives with positive valence, and the following of paths 
in the life space can all be treated within this framework. Perhaps 
the most valuable aspect of this approach is that it emphasizes 
the continuous control of sequences of behavior, thus combining 
the fine detail of a molecular analysis with the flexible, purposive 
view of a molar analysis. To what extent the approach will prove 


useful in dealing specifically with questions of learning still largely 
remains to be seen. 


SOME 
SPECIAL TOPICS 


WE HAVE NOW CONSIDERED a number of learning theories, with 
special emphasis on the distinction between connectionist and 
cognitive interpretations and on the ways that connectionist theories 
have been modified to take account of cognitive processes. There 
remain, however, certain problem areas in learning theory that 
deserve further attention. Certain questions, particularly in the area 
of motivation, have received a lot of attention from theorists who 
were neither major system builders nor bridge builders between 
connectionist and cognitive theory. Though there is no common 
theme in the work of all these theorists, all have made important 
contributions to the analysis of learning. 


INTERPRETATIONS OF REINFORCEMENT AND DRIVE 


One topic on which there have been important recent de- 
velopments is the nature of reinforcement. No theorist questions 
that the effects of our actions influence whether we will repeat 
these actions (which is the basic principle of reinforcement), 
but they disagree widely on how this influence should be in- 
terpreted. What, if anything, do all reinforcers have in common 
that makes them reinforcing? Guthrie, though he avoids the term 
“reinforcement,” says in effect that all reinforcement involves is 
stimulus change. Skinner and Estes for the most part ignore the 
question, saying simply that whatever reinforces behavior is a re- 
inforcer. Hull, Miller, Mowrer (as regards solution learning, even 
when it is reinterpreted as another kind of sign learning), Lewin, 
and Tolman all interpret reinforcement as some kind of drive re- 
duction, even though some of them do not speak either of drive or 
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of reinforcement. Miller’s drive, cue, response, and reward can be 
roughly translated into Lewin’s terminology as tension, path, ap- 
proach vector, and valence, or into Tolman’s as demand, sign, ex- 
pectancy, and goal object. While these may not be the neatest of 
translations, they illustrate that the drive-reinforcement interpre- 
tation, in whatever words, is a widely recognized one. 


New Descriptions of Drives 


There are currently three trends in the interpretation of reinforce- 
ment, two that work within this drive-reduction framework and 
one that does not. Within the drive-reduction framework, there 
has been a concentration on what drives are innate. In the past, con- 
nectionist psychologists who used the concept of drive have 
tended to treat as primary, or innate, only a few drives, such 
as hunger, thirst, sex, and pain, that are clearly physiological. Cog- 
nitive psychologists, on the other hand, have been more likely to 
discuss such motivational tendencies as pride, ambition, or affec- 
tion, which have no apparent physiological basis. These latter mo- 
tivational tendencies are regarded by connectionist theorists as the 
result of complex learning processes, including the acquisition of 
secondary drives. There is not necessarily any disagreement be- 
tween these two points of view, since the Cognitive theorists (other 
than Tolman) have not been much interested in the origins of the 
tensions or demands they discuss. The chief importance of these 
interpretations is that the connectionist theorists have had to make 
assumptions about the learning of motives that are extremely dif- 
ficult to support by experiments. Although Miller was quite suc- 
cessful in demonstrating a secondary drive of fear, other secondary 
drives that are predicted by theory have proved impossible to dem- 
onstrate in the laboratory. As a result, there has been increasing 


interest in the possibility that there may be far more primary 
drives than have Previously been recognized, 

One such drive (or perhaps category of drives) that has re- 
ceived much attention is that which is Satisfied by new experience. 


It is known variously as a drive of curiosity or exploration or ma- 
nipulation or novelty seeking. Its existence is supported by many 
lines of evidence. H 


arlow (whom we have met before) showed that 
monkeys would work to unfasten a clasp device over and over 
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again, even though neither food, escape, nor any other obvious 
reward was given for doing so. Butler showed that monkeys in a 
closed cage would press a lever to open a window so they could 
look out. If the monkeys were kept in the closed cage for several 
hours before the lever was made available, they would press more 
(indicating higher drive following deprivation). If the lever no 
longer opened the window, their rate of pressing would drop (indi- 
cating extinction). Such novelty-seeking behavior is found not 
only in monkeys, which have long been famous for their curiosity, 
but also in rats. Montgomery and Welker have made extensive 
studies of rats exploring new places and of the responses they will 
learn in order to be able to do such exploring. These and other re- 
lated studies are discussed by Berlyne (1960) and by Welker 
(1961). 

A second kind of drive that has recently been investigated is 
satisfied by activity. This is different from the kind of drive we have 
just discussed in that the activity does not have to produce any 
novel stimulation. Such activity occurs when a rat runs in an ac- 
tivity wheel, which is a sort of voluntary treadmill. A rat in such 
a wheel can, if he chooses, get exercise by running, but no explor- 
ation is involved, since the wheel just goes around without get- 
ting anywhere. Kagan and Berkun (1954) have shown that a 
chance to run in such a wheel will reinforce lever pressing, and Hill 
(1956) has shown that rats do more running in the wheel when 
they have been deprived of activity for a long time. 

A third kind of drive has been referred to as contact hunger, 
first described, like so many interesting phenomena, by Harry 
Harlow. This is a drive which is satisfied by certain kinds of physi- 
cal contact. Harlow originally studied it (Harlow, 1958) by rear- 
ing baby monkeys with surrogate mothers, wire models that gave 
milk like real mothers. He found that monkeys not only preferred 
a “mother” covered with terry cloth to one made of bare wire, but 
ran to the terry clot “mother” when frightened. They did so even 
if the wire “mother” supplied milk and the cloth “mother” did 
not. These and other findings in his experiments suggested to Har- 
low that contact hunger, exemplified by the baby monkeys’ “hun- 
ger” for the cloth surrogate mother, is an important factor in 
personality development. It may well play an important part in 
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personality development, not only in the attachment of infants to 
their mothers, but also in other aspects of love, in sex behavior, and 
in the development of social relationships. 

The importance of these various drives is not simply in the rec- 
ognition that people and animals seek new experience, activity, and 
pleasant sensations of touch. All of these tendencies have long 
been known. Rather, their importance is in the recognition, sup- 
ported by experimental evidence, that they may well be just as 
much a part of our innate, biological nature as hunger, sex, or 
pain. This recognition does not by any means answer all our 
questions about why men seek companionship, achievement, or 
glory. (Some of the unanswered questions are discussed in Hill, 
1968.) Nevertheless, it helps to bridge the gap between the drives 


usually employed in laboratory studies of learning and the motives 
we all recognize in ourselves and others. 


Optimal Arousal 


The second trend involves a modification of drive theory—the 
theory of optimal arousal. Different versions of this theory have 
been presented by several writers; one good presentation is in 
Fiske and Maddi (1961). This is the view that reinforcement con- 
sists not necessarily of a decrease in drive but rather of a change 
in drive toward some optimal level. The term arousal is preferred 
to drive in discussing this interpretation, so as to avoid confusion 
with more conventional drive theory. If the present level of arousal 
is higher than the optimal level, a decrease will be reinforcing, as 
in the ordinary drive interpretation. However, if the present level 
is below the optimum, an increase rather than a decrease will be 
reinforcing. When the level of arousal is exactly at the optimum, 
reinforcement is presumably not possible for the moment. How- 
ever, both the level and the optimum are constantly changing, so 
any such limitation is quite temporary. 

Optimal arousal theory allows for the often observed fact that 
people frequently seek more stimulation rather than less. Con- 
ventional drive interpretations, such as Miller’s, have been un- 
comfortable about such behaviors as riding on roller coasters, 
reading exciting stories, or just complaining of boredom and ex- 
pressing a wish that something would happen to break the monotony. 
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Since optimal-arousal theory recognizes that increases in arousal can 
be reinforcing as well as decreases, it is less troubled by such occur- 
rences. However, the recognition that either kind of change in 
arousal, up or down, can sometimes be reinforcing carries with it a 
responsibility to indicate when one will be reinforcing and when 
the other. This responsibility in turn requires knowing a lot about 
the determinants both of the present level of arousal and of the 
optimal level. 

One additional assumption (or postulate, in Hull’s terms) about 
the relation between stimulation and arousal represents a start 
toward making the theory more specific. This assumption states 
that the level of arousal is an increasing function of the intensity, 
the complexity, and the novelty of the stimulation. More intense 
stimuli, and more novel ones, are more arousing than those that 
are weaker or more familiar. This assumption, along with the one 
we have already mentioned—that moderate levels of arousal are 
more reinforcing than either high or low levels—forms the basis of 
optimal-arousal theory. 

Several predictions follow from these two assumptions. If novelty 
and complexity remain constant, moderate stimuli should be re- 
inforcing as compared with either weak or intense stimuli. If inten- 
sity and complexity remain constant, stimuli that are somewhat 
different from those we are used to should be preferred both to 
thoroughly familiar ones (which would be judged uninteresting) 
and to highly unfamiliar ones (which would be judged incompre- 
hensible). If intensity and novelty remain constant, stimuli that 
are moderately complex should prove more reinforcing than 
those that are either dully simple or overwhelmingly complex. 
Finally, if more than one aspect of the stimulus varies, then the 
higher the stimulus is on either intensity, novelty, or complexity, 
the lower should be the optimal value of the other variables. Thus, 
a highly novel stimulus should be more reinforcing if it is mild and 
simple, whereas among very familiar stimuli the more intense and 
complex ones should tend to be preferred. 

This assumption that arousal is an increasing function of in- 
tensity, novelty, and complexity fits in well with a concept that 
has been quite popular among theorists in the Soviet Union—the 
orienting reflex. Although the logic of the term is that individuals 
tend to orient themselves in varying degrees toward stimuli as they 
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appear, what is actually measured in the orienting reflex is pri- 
marily the general arousal produced by a stimulus. The greater the 
galvanic skin response, the change in heart rate, or the increase 
in muscle tension produced by a stimulus, the greater the orienting 
reflex to that stimulus is said to be. Measures of central-nervous- 
system activity, such as the electroencephalograph (EEG) also give 
indications of the strength of the orienting reflex. The orienting 
reflex is greater for more intense stimuli than for milder ones and 
shows some tendency to be greater for more complex stimuli than 
for simpler ones. If the same stimulus is presented repeatedly (thus 
making it more familiar), the orienting reflex progressively de- 
creases. However, it can be restored a good part of the way to its 
original strength in either of two ways—by changing it in some 
way or by just waiting for a while—either of which has the effect 
of making it more novel. Thus the orienting reflex serves as evidence 
that the level of arousal is affected by various aspects of stimuli 
in the way that optimal arousal theory says it should be. 

The reader will have noted that optimal-arousal theory attempts 
to deal with some of the same phenomena that drive theory has 
tried to deal with by postulating such drives as exploration and 
activity. To say that there are drives which can be reduced by 
exploring new things or by becoming more active is to say that 
sometimes an increase in stimulation produces a decrease in drive. 
Thus both drive theory and optimal-arousal theory are getting 
away from the older version of drive theory—particularly closely 
associated with Miller—that what is reinforcing is a reduction in 
the level of stimulation. Drive theory makes the shift by looking 
at each drive Separately without worrying about whether it con- 
forms to any general notion about drives as States of high stimula- 
tion. Optimal-arousal theory, on the other hand, keeps the idea of 
an over-all level of arousal that is closely related to reinforcement, 


but broadens its interpretation of just what the relationship to 
reinforcement is. 


Responses as Reinforcers 

The third trend in 
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what he does. This wording sounds like something Guthrie might 
say, though actually the idea is more like Spence’s. The extent to 
which re is conditioned depends, in Spence’s system, on the 
amount of goal activity that takes place. A lot of eating in a given 
place results in strong conditioning of ra to that place. How rein- 
forcing it is to get to a given place depends, therefore, on how 
much eating (or other goal activity) takes place there. 

Two men besides Spence have advanced this idea. One, Fred 
Sheffield, drew his inspiration from Guthrie. Sheffield maintained 
that the act of consuming a goal object, rather than any form of 
drive reduction, is the essential factor in reinforcement. Will a re- 
sponse that leads to some goal object be reinforced by the goal ob- 
ject and learned? This depends, says Sheffield, on whether the goal 
object is consumed. Food, for example, is reinforcing to the extent 
that we eat it. Our own activity of eating provides the reinforce- 
ment for whatever response brings us to the food. 

Sheffield did several experiments to support this position. For 
the most part, however, they were inconclusive. He showed, for 
example, that drinking a saccharine solution is reinforcing for 
rats, even though saccharine is completely non-nutritive (Sheffield 
and Roby, 1950). This finding certainly argues against the view 
that all reinforcers involve reduction in a body need, since sac- 
charine does nothing to reduce the body’s need for food. The study 
does not show, however, that the drinking activity was what pro- 
duced the reinforcement. One could just as well say that the sweet 
taste of saccharine is the crucial reinforcing factor. 

A more general formulation of this point of view has been 
presented by David Premack. He simply states that of any two 
responses, the one that occurs oftener when both are avail- 
able can reinforce the one that occurs less often, but not vice versa. 
This relationship is illustrated by an experiment with children 
(Premack, 1959). The children were given an opportunity to 
engage in two activities, eating candy and playing a pinball ma- 
chine. Some did one of these two oftener, some the other. Pre- 
mack then arranged the apparatus so that for half the children 
the candy dispenser would work only if the child first operated the 
pinball machine. To the children who preferred playing, the new ar- 
rangement made little difference—they just went on happily play- 
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ing the machine. For children who preferred eating to playing, 
however, this change did make a difference—their rate of play- 
ing went up, indicating that candy was reinforcing the playing. So 
far this is a familiar relationship—food reinforcing an instrumen- 
tal activity. For the other half of the children, the arrangement was 
reversed, so that the pinball machine would work only if the child 
first took candy from the dispenser. To the children who preferred 
candy, this change made no difference. The children who preferred 
playing, however, ate more candy under this arrangement than be- 
fore. In this case, playing reinforced eating. 


In this experiment, for any one child only one of the two 
activities could function as a reinforcer. This is not, however, a 
necessary restriction. For example, Premack (1962) has also 
studied the réinforcement relationships between running and drink- 
ing in rats. If a rat has spent a day without water, but with 
continuous access to an activity wheel, he will run for the re- 
inforcement of a chance to drink, as we would expect. If, instead, 
the wheel is locked so that he cannot run unless he first drinks, 
this relationship will have no effect on how much he drinks— 
running will not reinforce drinking. 

However, the situation can be reversed by having the same rat 
live for a day with water always available but with no chance to 
run. Now the tendency to run will be high and to drink will be 
low. Consequently, running can be used to reinforce drinking—if 
the rat has to drink in order to unlock the wheel and run, his 
drinking will increase. On the other hand, requiring the rat to run 
in order to be able to drink will now have no effect on the amount 
of running. These somewhat complicated relationships are dia- 
gramed in Fig. 8. 

Thus the same animal can be either in 
reinforces running or a state where runni: 
and in each case the reverse does not appl 
to change him from one state to the other 
this case, deprive him of drinking or of ru 
his relative tendency to engage in these tw 

In a conventional schoolroom, where 
demands on pupils is to sit still for 
expect from Premack’s analysis th 


a state where drinking 
ng reinforces drinking, 
y. All that is necessary 
is to do something (in 
nning) that will change 
activities. 

one of the most difficult 
long periods of time, we would 
at it would often be possible to 
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I u 
CHARACTERISTICS DEPRIVED OF DEPRIVED OF 
DRINKING RUNNING 


TENDENCY TO DRINK HIGH > LOW 
TENDENCY TO RUN LOW HIGH 


IF MUST RUN TO BE ABLE INCREASES HIS NO CHANGE IN 
TO DRINK RUNNING RUNNING OR 
DRINKING 


IF MUST DRINK TO BE NO CHANGE IN INCREASES 
ABLE TO RUN RUNNING OR HIS DRINKING 
DRINKING 


DOES DRINKING 
REINFORCE RUNNING? 


DOES RUNNING 
REINFORCE DRINKING? 


Fic. 8. The reversibility of reinforcement. Columns I and II give the 
characteristics of two states that a rat can be in with regard to the 
relative reinforcing effects of drinking and running. 


use access to the playground as a reinforcer for a certain amount 
of quiet work, but rarely possible to do the reverse. However, in 
a strenuous program of physical education, it might be possible 
to reinforce a student’s putting forth his last ounce of athletic 
effort by rewarding him with the chance to sit quietly and read a 
book. This latter arrangement would of course be expected to work 
better for the less athletic and more bookish students. Both indi- 
vidual differences and situational differences determine what activi- 
ties can reinforce what other activities for a given individual at a 
given time. 

In Premack’s interpretation, then, there is no special class of 
consummatory responses that act as reinforcers. Any kind of re- 
sponse can reinforce any other kind of response, says Premack. 
All that is necessary is for the reinforcing response to be one that 
the learner makes more frequently than the response being re- 
inforced. 
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Premack’s interpretation has the same drawback as Sheffield’s 
—that we do not know whether it is the activity itself or the con- 
sequences of the activity that acts as the real reinforcer. However, 
for many purposes this difference is unimportant. If we know 
how often an individual chooses to do any two things, we can pre- 
dict (according to Premack) whether or not one of them will re- 
inforce the other. To the extent that this relationship holds, it is 
a powerful predictive tool, regardless of how the relationship may 
finally be explained. 


FRUSTRATION THEORY 


In recent years, one important advance in the study of motiva- 
tion has been the elaboration of the Hull-Spence tradition to in- 
clude the analysis of frustration. This line of theorizing has 
received its greatest impetus from the work of Abram Amsel (b. 
1922). In some ways Amsel’s relation to Spence has resembled 
Spence’s to Hull, in that Amsel was Spence’s student and Spence 
incorporated many of Amsel’s ideas into his own thinking. Since 
Spence’s death there has been no one person who can clearly be 
considered the chief exemplar of the Hullian tradition, but Amsel 
probably comes the closest. Much of his work is summarized in 
Amsel (1962). 

What does frustration theory say? It begins with the concept 
of rg, which we have already noted can easily be translated into 
cognitive terms as an expectancy of reward. When an organism 
expects reward in a certain place (i.e., makes rg to the stimuli of 
that place) and then does not find reward, frustration results. 
Frustration is conceived to be a drive, produced by the non- 
fulfillment of an expectancy of reward just as other drives are 
produced by food deprivation or by injury to the body. Though 
the detailed reasoning is not always easy to follow, all the effects 
that frustration is assumed to have in various situations are aspects 
of its status as a drive. 


Some Effects of Frustration 


What are some of the properties of a drive? For one thing, it is 
a state of high arousal, multiplying habit strength to increase excita- 
tory potential. If frustration is a drive, it should be possible to 
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show that subjects will respond more vigorously immediately after 
being denied an expected reward than at other times. That subjects 
will respond thus has been shown by having a rat run down two 
alleys in succession. He is always rewarded at the end of the second 
alley, but only part of the time at the end of the first. Once he 
learns to expect (or at least hope for) reward at the end of the 
first alley, he should be frustrated on those trials when he does not 
in fact find reward there. This frustration drive should increase his 
total drive level, thereby producing a higher excitatory potential for 
running down the second alley than he would have if he were not 
frustrated. Thus rats should run down the second alley faster on 
trials when they are nonrewarded in the first alley than on trials 
when they are rewarded, a prediction which has repeatedly been 
confirmed. 

Another characteristic of a drive is that it functions as a stimu- 
lus. Just as a subject can learn to make one response when hungry 
and a different response when satiated, because hunger and satiety 
produce different internal stimuli, so he should be able to learn to 
make one response when frustrated and a different response when 
not frustrated. He should, for example, be able to learn to turn 
right in a T maze if there is food at the choice point and left if he 
is frustrated by finding no food at the choice point. This prediction, 
too, has been tested and confirmed. 

A third characteristic of a drive, as Miller emphasized, is that 
it can be conditioned to new stimuli, which a subject will then 
learn a new response to escape. In learning of a discrimination, 
the negative stimulus (Sê in Skinner’s terms) is a signal for frustra- 
tion and so should become a secondary drive stimulus. It should 
then be possible to place the subject in a new situation where he 
had never experienced either reward or frustration, turn on the 
signal, and teach him a new response to turn the signal off. Wagner 
(1963) and Daly (1969) have shown that rats will indeed learn 
to jump across a hurdle from one side of a box to the other in 
order to turn off a stimulus which has previously been a signal for 
frustration. 

The Partial-Reinforcement Effect 


Finally, just as subjects can learn to expect reward, they should 
also be able to learn to expect frustration. Just as re (or rp, R for 
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reward, in Amsel’s terms) comes to occur as a subject approaches 
a place where he has previously been rewarded, so rp (F for frus- 
tration) should also occur as he approaches a place where he has 
been frustrated. This anticipation of (possible) frustration is as- 
sumed to produce competing responses, responses that interfere 
with continuation of the goal-directed behavior. It therefore takes 
longer to complete the response. Spence included a reference to 
this effect as a minor part of his system in the 1956 book. He 
Suggested that E=H(D+K) is a complete formula only in cases 
of continuous reinforcement. Where nonreinforcement occurs, it is 
necessary to expand it to E=H(D+K) —I, with I referring to the 
interference from competing responses produced by frustration. 
The I would be present in extinction, in discrimination learning 
(for the nonreinforced alternative), and in partial reinforcement. 
Using I, Spence was able to derive various phenomena that could 
not have been explained using only H, D, and K. 

It should be noted that I, although it bears some resemblance 
to Hull’s Ip and gly, is a quite different concept. Whereas Hull’s 
terms refer to inhibitions against responding, Spence’s I refers to 
the active interference of one set of responses (those elicited by 
frustration) with another set (those getting the subject to the 
goal). It applies only in cases of nonreward, while I, and 
were assumed to be involved to some degree in all responding. 

It may seem strange to the reader that frustration should tend 
to make animals run faster in the two-stage runway, but yet that 
anticipatory frustration should make them run slower. It is true 
that theorists attribute so many different properties to frustration 
that there is a constant problem of maintaining consistency, one 
with which we can sympathize after Studying Hull’s system. Never- 
theless, it is not difficult to see intuitively how both these opposite 
effects might occur. The individual who is afraid he is going to be 
frustrated if he completes some course of action may hesitate and 
dawdle, easily distracted from a goal toward which he is ambi- 
valent. On the other hand, if he reaches the goal and is indeed 
frustrated, this frustration may well energize him to do something 
else, something from which he definitely expects reward, with 
extra vigor. Perhaps the unsuccessful (hence often frustrated) 
student who dawdles on his way to school but emerges from school 


sIr 


“FRUSTRATION THEORY 215 


onto the playground with a burst of energy bears some resemblance 
to the rat in the two-stage runway! 

The assumed occurrence of ry has also been used by Amsel to 
explain a phenomenon that has often troubled theorists, the 
partial-reinforcement effect. We mentioned in connection with 
schedules of reinforcement in Chapter 3 that it takes longer to 
extinguish a response following intermittent reinforcement than 
following continuous reinforcement. This generalization is true not 
only of the free-operant situation that we were considering there 
but also of the case where animals receive separate trials in a 
runway. In such situations it is customary, for some obscure his- 
torical reason, to speak of partial reinforcement rather than inter- 
mittent reinforcement. In partial reinforcement in a runway, for 
example, there is a reward at the end of the runway on some trials 
but not on others. Greater resistance to extinction following partial 
reinforcement than following continuous reinforcement is known 
as the partial-reinforcement effect, and it is this effect that Amsel 
attempts to explain by using the concept of rp. 


When a rat receives partial reinforcement in a runway, he 
presently begins making rp. At first, as we have noted, this frac- 
tional anticipatory frustration tends to disrupt his running. Never- 
theless, on each trial he eventually gets to the goal and is reinforced 
for his running. Running in the presence of rp (or, strictly speaking, 
in the presence of sp, the stimuli produced by rp) is reinforced. 
Eventually, therefore, the sp become no longer stimuli for com- 
peting responses, but rather stimuli for running. 

What happens when extinction begins? Rats that have received 
continuous reinforcement in the runway now begin for the first time 
to make rr, which give rise to competing responses (Spence’s I). 
Since extinction involves for them both a decrease in K and an 
increase in I, the rats extinguish the running response rapidly. For 
the partially reinforced animals, however, rp are nothing new and 
now tend to produce running rather than competing responses. For 
these animals there is thus only one factor tending to produce 
extinction—the decrease in K—while the additional factor of I 
is not involved. The extinction for these animals therefore proceeds 
more slowly, giving the partial-reinforcement effect. 

As with the Hullian theory from which it comes, frustration 
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theory is complex and constantly struggling to avoid getting so 
enmeshed in its various assumptions that it becomes self-contra- 
dictory. Nevertheless, its success in applying the rigor of Hullian 
theory to as clinically useful a concept as frustration has made it 
an important feature on the current theoretical scene. 


VERBAL LEARNING AND MEMORY 


The theories we have discussed so far in this book have for the 
most part not had much to say about memory for verbal materials. 
This silence may well have seemed strange to the reader, since 
remembering names, dates, lists of crucial facts, and innumerable 
other verbal materials plays such an important part both in school 
and in the world at large. 

Learning theorists have never entirely ignored verbal learning— 
Hull, for example, wrote an impressively elaborate deductive 
analysis of the process of serial memorization. For the most part, 
however, learning theorists have regarded verbal learning as too 
complex for use in theory building, and have preferred to seek basic 
laws of learning at the level of conditioning and animal learning, 
leaving for later the job of applying these laws to human memory. 
This strategy has left it for other people to study verbal learning 
and memory as topics in their own right. An outstanding example 
of a person who concentrated on verbal learning and memory, but 
without ignoring the larger field of learning theory, is Benton J. 
Underwood (b. 1915) of Northwestern University. 


Underwood should certainly be classified as a connectionist 
psychologist, for most of his analyses are in terms of verbal re- 
sponses and the stimuli, usually also verbal, that elicit them. Many 
of his experiments are simply concerned with finding the relation- 
ships between independent and dependent variables, with little in- 
terest in theoretical interpretations. In this respect he resembles 
Skinner. When he does act as a theorist, however, his interpretations 
are relatively formal: though he does not put them in the Hullian 
form of postulates and theorems, he does make them specific enough 
to be clearly testable with experiments. He uses the term reinforce- 
ment, but without much concern as to what the nature of reinforce- 
ment is; in this respect he is rather like Estes. A cautious, step- 


by-step approach, with great emphasis on proper research meth- 
ods, characterizes all of his work. 
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Causes of Forgetting 


One of Underwood’s many interests in verbal learning has been 
the question of why we forget. Once something has been memo- 
rized, why do we not remember it forever? Of the various explana- 
tions that have been offered, by far the most influential has been the 
interference interpretation, that we forget because our retention 
of any given item is interfered with by all the other items we have 
learned that resemble it in some way. Though we have mentioned 
only Guthrie as having such an interpretation, actually a number 
of the theorists we have discussed in this book agree that forgetting 
is caused primarily, if not exclusively, by such interference. 

Underwood agrees on the importance of interference in for- 
getting, but raises a question as to which of two sources of inter- 
ference is the more important. If we forget any given thing 
because of interference from something else that we learned after- 
ward, this effect is known as retroactive inhibition. The second 
thing we learned has in some sense replaced the first in memory, 
so that it is harder to retrieve the first. Traditionally retroactive 
inhibition (not to be confused with Hull’s reactive inhibition) has 
been regarded as the major form of interference (again, think of 
Guthrie’s explanation of forgetting). However, it is also possible 
that the interference with any given material could come from 
other material that had been learned before. This effect is known 
as proactive inhibition. These two possible forms of interference 
are diagrammed in Fig. 9. 

At first glance proactive inhibition seems like a less plausible 
cause of forgetting than retroactive inhibition. Once the new 
material has been learned, and the old material therefore supposedly 
unlearned, or at least pushed into the background, why should it 
interfere with retention of the new? Underwood points out that 
what happens to the first material when the second is being learned 
is comparable to extinction. But what happens to an extinguished 
response with the passage of time? Very often it shows spontaneous 
recovery. Why should we not expect that the material which was 
learned first would remain only temporarily extinguished, and then 
return through spontaneous recovery to interfere with the material 
learned second? If this process does indeed take place, the passage 
of time should serve not only to increase the effect of proactive 
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SEQUENCE LEARN FIRST LEARN SECOND TEST ON 


KEY LIST INTERFERING LIST KEY LIST 
INTERFERING LIST KEY LIST KEY LIST 
KEY LIST NOTHING KEY LIST 


COMPARISONS OF KEY-LIST SCORES ON THE TEST: 


C—A = RETROACTIVE INHIBITION 
C — B = PROACTIVE INHIBITION 


B — A = EXTENT TO WHICH RETROACTIVE INHIBITION IS MORE 
EFFECTIVE THAN PROACTIVE 


Fic. 9. Retroactive and Proactive Inhibition. By comparing retention 
of one list (here called the key list) by groups with different experience, 
we can measure the amounts of retroactive and proactive inhibition. 


inhibition but also to decrease the effect of retroactive inhibition 
through the recovery of the material to be remembered. Under- 
wood (1948) found that this expectation is indeed what occurs. 
With the amount and difficulty of the interfering materials held 
constant, retroactive inhibition produced much more interference 
with retention five hours after learning than did proactive, but after 
a two-day interval there was practically no difference between them. 
Which Cause Is Most Important? 


This experiment of Underwood’s demonstrates that proactive 
inhibition is just as plausible an explanation of forgetting as is 
retroactive. It does not, however, give us much basis for deciding 
how much each of these processes actually contributes to everyday 
forgetting. Underwood’s impression about the relative importance 
of these two processes came from another line of evidence. He 
noted that when college students memorized a list of words or 
nonsense syllables, the amount that they remembered from the 
list a day later depended greatly on how much prior experience 
they had had in memorization. We might except that the more 
experienced memorizers would show an advantage, and indeed they 
do if we look at the rate of original learning. However, if both 
experienced and inexperienced memorizers keep on working until 
they have mastered the list, and then stop, it is the inexperienced 
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memorizers who retain more after an interval of time (Underwood, 
1957). This difference does not seem so surprising when we con- 
sider that all the previous lists learned by the experienced group 
were building up proactive inhibition that interfered with retention 
of the final list. If there are many previous lists to produce proactive 
inhibition, this effect can be huge. In one experiment (Keppel, 
Postman, and Zavortink, 1968), subjects learned a series of paired- 
associate lists at two-day intervals and were tested for retention of 
each before learning the next. On the average they remembered 
70 percent of the items on the first list and only 5 percent of the 
items on the thirty-sixth list, even though the lists were equally 
difficult and were tested the same length of time after being learned. 
Clearly, differences in proactive inhibition alone can make re- 
markable differences in the amount of forgetting. 

Does it follow that proactive inhibition from things we have 
learned previously is the major cause of our ordinary forgetting? 
Not nécessarily, since experiments that vary only the proactive 
inhibition do not tell us about the effects of the retroactive inhibition 
or of other possible causes of forgetting. However, if we are 
concerned with forgetting over periods of days or weeks, there is 
likely to be much more other material that can interefere by way 
of proactive inhibition (given the previous years we have had to 
learn such material) than by way of retroactive (which would have 
to be learned during the days or weeks of the retention interval). It 
thus seems highly plausible that proactive inhibition is the most 
important cause of forgetting. 

Though the above line of argument with its supporting data 
represents a major contribution to our understanding of forgetting, 
Underwood was not satisfied to leave it at that. He went on to 
gather new data, and not all of what he found fits well into his 
explanation of forgetting. Because Underwood’s theory of forgetting 
is fairly simple and quite close to data, it has the sort of self-correct- 
ing character that Hull sought but did not always achieve. Through 
his application of simple learning principles to more complex forms 
of learning and through his constant checking of predictions against 
data, Underwood has provided not only an interpretation of for- 
getting but also an impressive example of careful, small-scale theory 
building. 


LEARNING THEORY 
PRESENT AND FUTURE 


IN ANY SURVEY of an area of knowledge, such as that attempted 
in this book, there is a great danger that the reader will come away 
with a kaleidoscope of impressions—some, hopefully, interesting 
and enlightening—but with no over-all picture of the field. To a 
cognitive theorist, at least, such an outcome would seem most re- 
grettable. In hopes of avoiding such an outcome, let us consider 
what the various interpretations we have examined can contribute 
to our understanding of learning. 


THE HOPE FOR A BETTER THEORY OF LEARNING 


First, as to the place of theory in general. All interpretations 
of learning are to some extent theoretical. No matter how hard one 
tries to “stick to the facts,” he is bound to select certain facts 
at the expense of others and to generalize beyond his evidence, 
and he is very likely to talk about processes that he cannot directly 
observe. Our choice is not between facts and theories but between 
narrow theories and broad ones, and between casual theories and 
formal ones. The current trend in the psychological interpretation 
of learning seems to be both toward narrower and toward more for- 
mal theories. Such a generalization is risky, however, for there are 
many currents in psychological thought, and over-all trends are 
difficult to determine. 

Most psychological interpreters of learning would probably rec- 
ognize as ultimately desirable a broad, formal theory from which 
a wide range of learning laws could be predicted with a high 
degree of precision. There is, however, much skepticism about 
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the practicality of such a system in the near future. For the pres- 
ent, there is room for many approaches to theory. Systems of classi- 
fication, collections of very specific laws, broad rough-and-ready 
generalizations, and formal deductive theories all have a legiti- 
mate place in a young and growing field such as the psychology 
of learning. 

There are two generalizations that we can make with a good 
deal of confidence. One is that narrow theories have a greater 
chance to be precise than do broader ones: the more situations 
one tries to encompass in a theoretical system, the more likely it is 
that one’s predictions will apply more-or-less to many situations 
but not closely to any. The other generalization is that theories can 
be more precise when the situations they deal with are controlled 
experimental situations than when they are the variable situations 
of everyday life. Theories developed in the laboratory are, of 
course, applied to everyday life, but in this application they lose 
some of the precision that is their greatest asset. 

The two above generalizations help to explain the variety of ap- 
proaches to theory that we find in the study of learning. For a per- 
son who deals with a great variety of people in different situa- 
tions, it may be both more useful and more satisfying to have such 
a broad generalization as, “Learning is most effective when the 
learner is motivated but not threatened.” As a scientific law, this 
generalization is not very meaningful, since it leaves unspecified 
what we mean by “motivated” and by “threatened.” It does, 
however, have a valuable use in telling us what to look for in a 
learning situation. Another person, dealing with one particular 
learning situation, might find it both more useful and more sat- 
isfying to know that “New Yorkers learn to use this machine most 
effectively if they are trained in the morning, shown a technique 
twice before practicing it, and offered a bonus for fast learning.” 
This generalization permits more precise predictions than the pre- 
vious one, and hence can be more easily and accurately used, but its 
range of application is too narrow to give it any general significance. 
Ideally, we would like to be able to combine the precision of the 
second generalization with the breadth of the first. This is a goal 
that keeps many theorists working. Until we can, however, theories 
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that give us narrower predictions or vaguer ones will both have 
their place in psychology. 

Apart from the general question of what form theories of learn- 
ing should take, there are the various contrc ersies about the 
terms in which learning should be interpreted. The most conspicu- 
ous is that between connectionist and cognitive interpretations, with 
those about the nature of reinforcement only less conspicuous, 
and a number of lesser ones much less noticed. Earlier in the his- 
tory of modern psychology, these differences in interpretations 
were the basis of vigorous arguments among various theoretical 
schools of psychology. In recent years the differences have become 
less clear-cut and the controversies about them quieter. As new 
data were gathered, all the system builders who wanted to re- 
main in business had to make adjustments in their systems to ac- 
commodate these data. In the process, they acquired more respect 
for the complexities of learning, more humility about their own 
systems, and more respect for the efforts of others. 

This quieting does not mean that theoretical differences are dis- 
appearing—far from it. It does mean, however, that system builders 
are tending to be more tentative in their formulations. Rather than 
looking on theories as true or false statements about the nature of 
the world, they look on them as more or less adequate ways of sum- 
marizing present knowledge and of facilitating the gathering of 
new knowledge. Rather than saying, “I have the correct interpre- 
tation, and yours is wrong,” they say, “Let’s all see how far we can 
go with our theories in explaining the data, and I’m guessing that 
I can go farther than you.” An extreme example is seen in the title 
of an article by Frank Restle: “Run Structure and Probability 
Learning: Disproof of Restle’s Model.” 7 

Along with this change goes an increasing recognition of the 
boundary conditions of theories and an increasing respect for 
specialization. The difference between two theories is more 
likely to be seen as a reflection of different interests than as a dis- 
agreement about fundamental principles. Theorists now differ 
not so much in their biases about the nature of learning as in the 
areas they prefer to study and the methods they prefer to use. The- 
oretical controversy is by no means dead, but it tends to be both 
more muted and more closely related to specific topics than was 
once the case. 
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Criteria for an Ideal Theory of Learning 


For all this increased modesty among theorists of learning, the 
goal of building a general theory of learning, broad yet precise, re- 
mains as an ultimate aspiration. What would such a theory be like? 
It would probably be formal, very likely stated much like 
Hull’s: in terms of definitions, postulates, and theorems. This 
formality would serve both to make the theory more precise in 
its predictions and to reveal its inadequacies more clearly so that 
they could be corrected. It would be a theory constantly undergoing 
modification in its details with new data, yet so well conceived that 
its basic structure could accommodate a wide range of possible 
findings. Ideally, it would cover the whole range of learning phe- 
nomena in animals and men and the whole range of independent 
variables that affect learning; yet it would cover all this range 
with a small number of postulates. It is easy to see Why most psy- 
chologists consider such a theory far in the future! 

Such a theory would have to deal with the complexities of human 
symbolic learning and insightful problem solving. It would have 
to allow for the flexibility of behavior, for the way that the same 
knowledge or belief can be used in many different ways under dif- 
ferent conditions. This means that it would have to go beyond any 
existing connectionist theory in dealing with cognitive processes. 
At the same time, it would have to Tecognize that such cognitive 
flexibility is far from universal even in the most intelligent of hu- 
mans. It would have to deal with simple conditioning, with rote 
memorization, with skills that we can display without knowing 
how we do it, with blindly Tepetitive habits, and with lack of in- 
sight in situations where the Opportunity for insight seems clear. 
And in addition to dealing with all of these forms of behavior when 
they occur, the theory must also tell under what conditions each of 
them will occur. 

To accomplish such a monumental task 
undoubtedly take account of the develo 
consider, in much more detail than any 
influence of earlier learning on later lear; 
will learn, how he will learn it, and wha 


learning in any given situation, all depend greatly on his prior 
learning. Tolman Tecognized this 


ral fact in his concept of field cog- 
nition modes, Harlow in his discussion of learning sets, and the 
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Kendlers in their analysis of mediating responses in reversal and 
nonreversal shifts. Hull’s concept of the habit-family hierarchy as- 
sumes the importance of prior learning. Learning to learn is a much 
discussed topic in the field of rote memorization. Donald Hebb of 
McGill University, in a theory primarily concerned with perception 
and motivation (Hebb, 1949), has emphasized the importance of 
early experience on later learning, and this interest has led to much 
research. To predict whether a given individual’s learning in a 
given situation will be flexible or stereotyped, insightful or rote, our 
“theory of the future” will certainly have to give much considera- 
tion to the effects of his prior experiences on his present learning 
Processes. 

Our ideal theory must also take into account the detailed stim- 
ulus situation. This aspect of learning has been most emphasized 
by Guthrie and by the gestalt psychologists. As might be expected 
from the great differences between these two systems, the forms that 
this emphasis takes in the two are in marked contrast to one another. 
Guthrie is concerned with getting a particular response that has 
Occurred once to occur again, and he points out that it is most 
likely to occur again when the stimuli on the two occasions are most 
similar, The gestaltists, on the other hand, are interested in getting 
an appropriate new response to occur by a process of insight, and 
they point to the importance of having the situation so arranged 

_that the desired response will form part of a good gestalt. These 
effects of the exact stimulus situation, both in the stereotyped use 
of old responses and in the insightful appearance of new ones, must 
be taken into account by any adequate theory. n 

The two factors we have just discussed—past experience and 
the present stimulus situation—interact through the process of 
Paying attention. How the stimulus situation affects the individual 
depends on what aspects of it he pays attention to. Attention was 
long a relatively neglected concept in learning theory, but this neglect 
has begun to be repaired. Such concepts as “observing response 
and “acquired distinctiveness of cues” testify to this interest in 
attention. Lewin has carried this line of thought farther by noting 
that the effect of any experience depends on how we interpret it. 
This observation is undoubtedly true; the problem is to take ac- 
Count of this fact without losing track of the external stimulus and 
thus losing track of learning as well. Nane of the learning theories 
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we have considered has an adequate answer to this problem, but 
our ideal theory would have to include one. 

Finally, an ideal theory of learning would have to deal effec- 
tively with the topics of motivation and reinforcement. We know 
that learning (or at least learned behavior) depends on our needs 
and desires and on the positive and negative incentives around us. 
We also know that the consequences of an act are a major factor 
in whether we repeat that act. All theoretical interpretations of 
learning take account of these relationships in one way or another, 
but many unanswered questions remain. What needs or drives or 
demands are we born with, and how are they modified by learning 
to produce the multiplicity of adult motives? Do rewards act to re- 
inforce responses, or to give Positive valence to incentives, or both? 
Such questions as these remain unanswered, not because theories 
have failed to deal with them, but because different theories have 
dealt with them in so many conflicting ways that it is difficult to 
choose the most satisfactory. A truly adequate theory of learning 
would answer these questions in such a way as to sum up all our 
present knowledge about motivation. 


Candidates for an Ideal Theory 


What sort of theory could meet all the requirements we have 
set for it? At present the best guess seems to be that if one comes it 
will be a connectionist theory, but one in which symbolic mediat- 
ing responses play a large part. It will interpret learning as basi- 
cally a matter of stimulus-response connections, but with many of 
the connections being among mediating responses that serve to 
tie together complex patterns of behavior. It will recognize both the 
many innate sources of drive and reinforcement and the ways in 
which secondary drives and reinforcers are acquired. It will rec- 
ognize both individual differences and age differences in learning 
abilities more than any current theory of learning, It will include 
a gradual increase in the complexity of mediating responses as a 
result of early learning. It might look somewhat like Spence’s sys- 
tem, expanded and modified by Estes, Harlow, Osgood, and oth- 
ers, and with some features not found in any current learning the- 
ory. Both the vigor and the ingenuity of present-day connectionist 


theorists support the guess that they are most likely to provide the 
basis for the “theory of the future.” 
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We certainly cannot rule out the possibility, however, that such 
an integrative theory might be more like present-day cognitive 
theories. The language of perceptions and beliefs is a more natural 
one for most people than the language of stimulus-response con- 
Nections, and a theory based on such cognitive language might be 
less cumbersome than the kind of theory discussed above. Like 
Lewin’s, such a theory would begin with our view of the world 
around us, and would treat learning as a modification in that view. 
It would, however, concern itself more than those theories do with 
the relation of the life space both to external stimuli and to overt 
responses. In this respect it would be more like Tolman’s theory, 
as well as somewhat similar to Mowrer’s latest interpretation. As 
Tegards external stimuli, it would have to include postulates about 
the ways that changes in stimuli produce changes in the life 
Space. These postulates would deal both with the long-run 
changes in cognitions that we call learning and the short-run 
changes involving moods and momentary perceptions. They 
would also have to make allowance for differences in these effects 
according to heredity, age, and previous experience. As regards 
Overt responses, the theory would have to recognize that cogni- 
tions exist at different levels of analysis. A cognition about how 
to get from one place to another includes not only a cognitive map 
of the route but also many tiny cognitions about the movements 
necessary to follow the route. These tiny cognitions are often ones 
that the individual is quite unable to describe, but they neverthe- 
less have to be included somehow in his life space in order to give 
a complete explanation of his behavior. Finally, the ideal cogni- 
tive theory would require some way of dealing with those cases 
where what an individual tells us about his attitudes and motives 
fails to agree with his life space as we infer it from his deeds. There 
is little indication at present that any cognitive theory is likely 
to solve these problems and achieve the necessary combination of 
accuracy and breadth to qualify as an adequate theory of learning. 
If one should, however, it might well be more useful than the kind 
of connectionist theory discussed above. oe F 

A third possibility for an ideal theory of learning is a combina- 
tion of the above two. Since cognitive theories deal best with com- 
Plex symbolic behavior and connectionist theories do best with 
simpler behavior, a theory might make use of both interpreta- 
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tions. Such a two-part system might be even more like Tolman’s 
than the purely cognitive one discussed above. For all its apparent 
advantages, it is unlikely that such a system would have a lasting 
vogue. Man’s desire for consistency would keep urging that one 
kind of learning be explained in terms of the other. The result would 
probably be much like the ideal connectionist theory we have 
considered, in which cognitive processes are explained as the prod- 
uct of complex stimulus-response connections, 

A final possibility is that the feedback model will be elaborated 
to the point where it can satisfactorily deal with the whole range 
of learning. This approach has the advantage of being well 
adapted to topics as diverse as muscular coordination and the at- 


tainment of life goals. It has, however, a long way to go before 
it can fulfill the theoretical needs of th 


to answer. How mi 
feedback mechanisms fail to kee 


l one of them succeeds in bring- 

Sponse is then learned as a 
is interpretation of learning 
connectionist interpretation. 
uires its particular directions 
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feedback mechanisms makes less difference than that it makes 
clear and accurate predictions about learning. It is by this criterion 
that we must judge different attempts to build a comprehensive 
theory of learning. 


THE PRESENT-DAY VALUE OF LEARNING THEORY 


Whatever form the theory of the future may take, most of us 
have to make the best of the theories we have today. Granted 
that all of them fall short of the ideals we have set for a truly ade- 
quate theory, what use can we make of them? For the most part, 
whatever exact predictions they make are applicable only to 
carefully controlled laboratory conditions. They can rarely predict 
directly to the complex, uncontrolled conditions of everyday life. 
Even with the most precise of scientific theories, engineers are 
needed who can use their ingenuity and experience to apply the 
theory to practical ends. Since theories of learning are not among 
the most precise, this “psychological engineering” is perhaps even 
more challenging than in other areas of science. Such “engineering” 
enters into many jobs, from animal training to psychological war- 
fare, but above all it is the province of the educator. 

Hilgard (1964) has argued that a single category of behavioral 
engineers or applied psychologists, in contrast to pure research 
Psychologists, is much too simple. He suggests that we can dis- 
tinguish six steps along the scale from the “purest” of researchers 
on learning at one extreme to the “purest” of educational appliers 
at the other. The first three steps are all under the heading of pure- 
Science research, but vary in the extent to which the learners and 
the learned materials resemble those in ordinary educational 
practice. The remaining three steps, which he groups under “tech- 
nological research and development,” are of more specific interest 
here. These are: step 4—“research conducted in special laboratory 
classrooms, with selected teachers”; step 5—“a tryout of the results 
of prior research in a ‘normal’ classroom with a typical teacher”; 
step 6—"developmental steps related to advocacy and adoption 
(Hilgard, 1964, p- 409). Here the research and development 
proceed from asking, “Will it work under the best of conditions? i 
in Step 4, to “Will it work under typical conditions?” in step 5, to 
“Will it be generally adopted by educators?” in step 6. 


230 THEORY PRESENT AND FUTURE 
The applied psychology of learning is important not an 
way of putting theories to practical use but also as ai wayo lied 
proving theories. Along with their other contributions, app If 
Studies help to determine the boundary conditions of theories. H 
a theory based on laboratory data is used to make ee! > 
an applied situation, and the predictions are not confirmed, He 
event shows that the theory is not appropriate for that atama 
It may still, however, be a perfectly good theory for pedenn m 
other situations. In addition, applied studies reveal new laws t = 
may then be used in the modification of old theories or the co t 
struction of new ones. Gagné (1962), for example, has p 
sented some principles of learning that he has found valuable a 
military training. These principles are concerned with the hierarc 
ical arrangement of component tasks and sequences of operations 
within a larger task. They are rather different from the principles 


that have been emphasized by most theories, and might serve as a 
basis for new theoretical 


work in the psychology of learning. 
For most of us, the various learning theories have two chief val- 
& us with a vocabulary and a conceptual 
ting the examples of learning that we ob- 
serve. These are valuab! 


ne who is alert to the world 
osely related 


= responses to be learned 
4 they will be used, and also 
to the value of practicing them under varied conditions if they 
are to be firmly established. Skinner advises us always to find out 
what reinforces a given act, so that we can present that Teinforcer 
if we want the act to occur or Temove 


it if we want t ; : 
O ext h 
the act. Miller and Dollard warn us to consi a coidary 


4 der th 
drives that may be learned in a Situation and ma cn Ra 
the basis of new, often undesired, learning. Wertheimer and ae 
ler point out the importance of arranging learning situations a 
to foster real, creative understanding rather than blind 


Tote memo- 
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tization. Lewin suggests that we try to reconstruct the learner’s 
life space, with his tensions and goals and the paths and barriers 
that he perceives. Tolman, Hull, and Estes offer many of these 
same suggestions in more technical form. All of these RAA = 
quire ingenuity if they are to be put to practical use. Each, a 5 
Serves to emphasize some aspect of the learning process that we 
would be wise to consider. Thus each serves both to enrich our un- 
derstanding of the learning situations we observe a to hing i 
find solutions to the practical learning problems with w. iei ye 
have to deal. While many theorists aspire to make a pira 

bution than this, and to some extent succeed in doing so, this ot 
tribution alone is enough to make theories invaluable to the study 


of learning. 
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SUGGESTIONS FOR FURTHER READINGS 


A. GENERAL READINGS 


READERS WHO WANT to study the topics covered in this book more thor- 
oughly will find the following references helpful. 


Deese, J., and Hulse, S. H. The psychology of learning. 3rd ed. New 
York: McGraw-Hill, 1967. — Though not oriented toward theoretical 
systems, this book provides a broad coverage of the psychology of 
learning, including a number of theoretical issues and the data 
bearing on them. 

Goldstein, H., Krantz, D. L., & Rains, J. D. (Eds.) Controversial issues 
in learning. New York: Appleton-Century-Crofts, 1965. — A collec- 
tion of original articles on eight controversies within learning theory. 
They are often difficult reading, but will give the reader a sense of 
what some of the arguments among learning theorists have been like. 

Hilgard, E. R., and Bower, G. H. Theories of learning. 3rd ed. New 
York: Appleton-Century-Crofts, 1956. — The standard secondary 
reference on contemporary theories of learning, providing both 
authoritative descriptions and critical evaluations of the outstanding 
theories. The book as a whole is highly readable, though a few of the 


k chapters are quite technical. 

och, S. (Ed.) Psychology: a study 

tematic OA, aie Me and special processes. New York: Mc- 
Graw-Hill, 1959. — A collection of 12 readings on various modern 
learning theories, most of them written by the theorists themselves. 
The study of which this volume forms a part emphasizes certain topics 
in the philosophy of science, with the result that the distinctive styles 
of some of the authors are partly lost. However, this is the most com 
plete and up-to-date source book available. The chapters vary in diffi- 
culty, but most of them are harder reading than Hilgard. 


Spence, K. W. J. T. (Eds.) The psychology of learning 
: oo ee demic Press. Vol. 1, 1967; Vol. 2, 


and motivation. New York: Aca l ; 
1968. — The original articles in these volumes give very recent de- 
velopments on such topics as the partial reinforcement a 
curiosity motivation, and the nature of memory. (Vol. 3 is also 
available, but is less relevant to learning theory.) LN 
Woodworth, R. S. Contemporary schools of psychology. Rev. ed. New 
York: Ronald, 1948. — A short, easy, secondary reference for the 
recent historical background of contemporary psychological theory. 
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of a science. Vol. 2. General sys- 


238 FURTHER READINGS 


B. PARTICULAR THEORISTS 


Among the primary sources mentioned in the text, the following are 
especially recommended for their readability and their concern with the 
applications of learning theory. 


Dollard, J. C., and Miller, N. E. Personality and psychotherapy. New 
York: McGraw-Hill, 1950. — An outline of Miller’s learning theory, 
followed by its application to the development of personality and the 
learning and unlearning of neuroses. 

Guthrie, E. R. The psychology of learning. Rev. ed. New York: Harper, 
1952. — The author's interpretation of a number of learning situa- 
tions and his comments on several other theorists. 

Skinner, B. F. Science and human behavior. New York: Macmillan, 
1953. — A popular summary of the chief variables in learning as 
studied by the author, followed by the interpretation of organized so- 
cial behavior in terms of reinforced learning. 

Wertheimer, M. Productive thinking, New York: Harper, 1945, — A 
consideration of education, problem-solving, 


and creativity from the 
gestalt point of view. 


C. A FICTIONAL COMMENTARY 


McConnell, J. Learning theory. In G. Conklin (Ed.), Great science 
fiction by scientists. New York: Crowell Collier, 1962. 
view of learning theory in the form of a science-fictio 
Since almost any learning theorist can interpret it as 
other learning theorist, the stor 
for everyone. 
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INDEX OF TOPICS AND PARTIAL GLOSSARY 


FOR EACH TECHNICAL term, the reference that explains the meaning of 
the term is given in italics, In addition, a few terms are defined here for 
quick reference or for increased clarity, However, the reader should 
keep in mind that these are rough-and-ready definitions and that usually 
the best way to understand the meaning of a term is to read the discus- 
sion of it in context. 


Al and A2 (the two possible response classes to which a stimulus ele- 
ment may be conditioned), 181-193 
A and B solutions, 102-106 
abstraction, 19-22 
acquired distinctiveness of cues, 170 
activity drive, 205 
annoyers, 58 
applied psychology, 229-330 
arousal, 84, 138 
optimal, 206-208 
attention, 53, 188, 170, 225 


barriers, 107-115, 125, 151-152 
behavior modification, as form of psychotherapy, 77 
behaviorism, 32-40, 118, 127 
purposive, 116-130 
boundary conditions (those conditions which must apply in order for a 
given theory to be valid in a given situation), 160 


cathexes, 124-125 
closure, law of, 100-101 


243 


244 INDEX OF TOPICS 


cognition (a belief, attitude, or perception), 28, 118-130, 227 

cognitive (explaining behavior and learning in terms of cogniton, q.v.) 

cogntive maps, 723, 125, 168 

computers, 196-199 

conditioned inhibition, 143-149, 156 

conditioned stimulus See conditioning, classical 

conditioning, classical (a form of learning in which two stimuli are pre- 
sented together and the response originally elicited by one of them, 
the unconditioned stimulus, comes to be elicited also by the other, 
the conditioned stimulus), //-/3, 35-37, 39, 55, 61, 165, 185, 224 

consciousness, 32, 93-94, 107-108 

consummatory response (a response by which one takes in or uses up a 
reward), 208-210 

contact-hunger, 209-210 

contiguity (the occurrence of two events together), 162-164 


contingencies of reinforcement, 77 
control: 


of behavior, 21 
in experiments, 74-15 
corollaries, 740 
crying, treatment of, 77 
Cue-producing responses, 170-174 
cues (stimuli that serve to guide behavior), 83, 88, 137, 201, 204 
` acquired distinctiveness of, 170, 225 
relevant and irrelevant, 172-173 
curiosity, 204 


cybernetics, 199-202, 228 


D (=drive, q.v.) 
demands, 1/9, 204, 226 
detours, 151-153 
development, 224 
discrimination (learning to make di 
10-12, 65, 147-149, 188, 201 
distributed Practice, 143 s 
drive (an aroused state of an organism, which motivates action), 26, 
80-82, 136-141, 155-156, 161-162, 188, 201-202, 204-206, 226 
discrimination, 126 
learned or secondary, 86-89, 143-145, 204 
reduction of as reinforcement, 83, 138, 145, 162-167 
relation to other concepts, 83-84 
drugs, effects of, 69-70 


ifferent responses to similar stimuli), 
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education, 2-6, 14, 78-79, 102-104, 160, 229 
by programmed instruction, 78-79 
effect, law of, 58 
Also see reinforcement 
emotion, 12-13, 33, 38-39, 63-64 
equivalence beliefs, 124-125 
error factors (systematic but wrong ways of responding in a problem 
situation), 171 
excitatory potential, 136, 140-142, 147-150, 151-156 
exercise, law of, 58 
expectancies See sign-gestalt expectations and field expectancies 
experiments, 17-19, 131-133 
exploration, 204 
extinction (the weakening of the tendency to make a response that 
occurs when the response is not followed by a reinforcer), 10-12, 


51, 155-156, 188-189, 217 


fear, learning of, 39, 86-90, 165-168, 204 

feedback, 199-202, 228-229 

field cognition modes, 125-126, 171, 224 

field expectancies, 125, 168 

figure-ground relationships, 95 

forgetting, 51-52, 101-102, 189-191 

fractional antedating goal reaction, 145-146, 150-154, 161 

frequency, principle of, 37-38, 42, 58 

frustration (the drive produced by not receiving an anticipated re- 
ward), 212-216 


generalization (the tendency for a response to occur to a new stimulus 
similar to one that was present during original learning), 10, 12, 


147-149, 187-188 


gestalt psychology, 93-116, 225 
goal, location of, 107-115, 121-122, 150-153 


habit-family hierarchy, 152, 224 
habit strength, 136-141, 152-156, 161-163 
habits (learned tendencies for a certain stimulus to elicit a certain re- 


sponse) : 


methods of breaking, 47-51 
hypothesizing (assuming for theoretical purposes that something is true 


although it has not been definitely shown to be true), 25 
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imitation, 84-86 

incentive motivation, 136, 139-141, 152-157, 160-163, 226 

individual differences, 78, 81, 126-127, 154 

input, 199-200 

insight (a sudden understanding of the solution to a problem), 7, 97-98, 
102-106, 150-153, 193, 224-225 

instinct, 33-34 

Ir (=reactive inhibition, q.v.) 


K (incentive motivation, q.v.) 


language, 76, 174-177 

latent learning, 120-121 

laws, scientific (statements about the conditions under which certain 
events occur), 16-27 

learning, all-or-none models, 194-196 

learning curves (graphs showing how much has been learned at each 


stage of practice), 149-150, 186-187 
learning, definition of, 1-2 
learning sets, 170-172, 224 
life space, 107-115, 118, 125, 201, 227 
light-guessing experiments, 192 
logic, 105 
Logic Theorist, 198 


machina docilis, 197-198 

magazine training, 72 

maintaining stimuli, 45-46, 52 
maintenance schedule, 84 

maze learning, 13 

meaning, 174-177 

measurement, 14, 153-155 

mediating responses, 169-177, 226, 228 
memorization, 3, 7-8, 216-219, 224 
molar behavior, 118 

molecular behavior, 118 

mother surrogate, 205-206 
movement-produced stimuli, 53-54, 169 


neurosis, 86-90 
nonreversal shift, 172-173 


